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PREFACE 


This report describes part of a comprehensive and continuing pro- 
gram of research concerned with advancing the state-of-the-art in 
remote sensing of the environment from aircraft and satellites. The 
research is being carried out for NASA's Lyndon B. Johnson Space 
Center (JSC), Houston, Texas, by the Environmental Research Institute 
of Michigan (ERIM) . The basic objective of this multidisciplinary 
program is to develop remote sensing as a practical tool to provide 
the planner and decision-maker with extensive information quickly and 
economically. 

Timely information obtained by remote sensing can be important to 
such people -as the farmer, the city planner, the conservationist, and 
others concerned with problems such as crop yield and disease, urban 
land studies and development, water pollution, and forest management. 

i 

The scope of our program includes : 

1. - Extending the understanding of basic processes. 

2. Discovering net-? applications, developing advanced remote- 
sensing systems, and improving automatic data processing 
to extract information in a useful form. 

3. Assisting in data collection, processing, analysis, and 
ground-truth verification. . 

The research described herein was performed under NASA Contract 
NAS9-15476 and covers the period from November 15, 1978 through 
November 14, 1979- I. Dale Browne/SF3 was the NASA Contract Technical 
Monitor and Thomas Pendleton/SF3 was the primary NASA Technical Coordi- 
nator of the activity. The program was directed at ERIM by Richard 
R. Legault, Vice President and Head of the Infrared and Optics Division, 
Quentin A. Holmes, Program Manager, and Robert Horvath, Head of the 
Analysis Department. 
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APPENDIX A 

EXTRACTION OF TEMPORAL-SPECTRAL FEATURES 

A.l UTILITY OF TEMPORAL-SPECTRAL FEATURES 

A. 1.1 BACKGROUND 

Assignment of" crop type or crop group labels to a set of labeling 
targets (pixels, fields, etc) often involves recognition of a temporal- 
spectral pattern that is characteristic of "a crop or crop group. Since 
the labeling targets consist of' populations of plants rather than a 
single plant, there is likely to be some variation in stage of develop- 
ment within the target area at any given, point in time. As a result, 
crop development and the accompanying spectral development at the target 
level will, in most cases, appear to be a continuous rather than a dis- 
crete process. As in most other biological population phenomena, these 
development patterns could be expected to conform to a Sigmoid curve 
(see Figure A-l) . 

Based on these assumptions, one can conclude that the discrete 
observations obtained from Landsat are samples from a continuous pattern 
of spectral development. Characterization of that pattern,, based on the 
given set of samples, should allow more complete description of the tar- 
get and, as a result, more accurate labeling. The term ’profile’ is used 
to describe the mathematical representation of this developmenf.pattern. 

A. 1.2 LEVELS OF USE 

Representations of the temporal-spectral development of crops, 
fields, etc may be used in a variety of ways. Since the degree of 
precision demanded of the profiles varies with intended use, it is 
of value to define a set of levels or categories of application. Models 
ma!y then be developed and evaluated at the level of use for which they 


1 





are intended, and extended to other levels (particularly higher or more 
demanding levels) only after careful re-testing and, perhaps, modifi- 
cation. 

Level 1 ; At the first level, a profile need only provide a 
stylized representation of the crop development pattern. Small devia- 
tions from the norm that might be observed on a target level needn't 
be accounted for; a relatively simple model, even one that only con- 
nects observations with a series of line segments, may suffice. 

Level 2 ; Estimation of a particular feature of crop spectral 
development requires a more accurate profile fit, at least in that 
portion of the profile from vhich the feature of interest is derived. 
Thus, a Level 2 model must, on the whole, be more accurate and flexible 
than one which could be used at Level 1. 

Level 3 : The next level of application involves characterization 

of overall crop development and multiple features. Models at this 
level must be able to accurately portray spectral development 
throughout the growing season, and as _ such must be even more flexi- 
ble and accurate than at the previous levels. This level, which is 
the focus of the study reported here, will be more fully discussed 
in Section A. 1.4. 

Level 4 ; The final level in the sequence involves a still greater 
accuracy requirement. If a profile can be fit to a set of observations 
accurately enough, and the interpolation and/or extrapolation of pro- 
file values can be carried out wisely enough, then it may be possible 
to use the profile values themselves as data, replacing or augmenting 
the Landsat observations. While precise mathematical fitting is 
important here, the major need is understanding of the physical and 
biological processes taking place and their impact on spectral char- 
acteristics, and incorporation of that understanding into the fitting 
process. 
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A. 1.3 PAST APPLICATION OF PROFILE TECHNOLOGY 

Multitemporal characterizations of spectral development have been' 
used previously at both Levels 1 and 2. Most of the work has involved 
spring small grains and Tasseled-Cap Greenness, and utilized a model 
developed at ERIM. This model is of the form 

2 

F(t) = at^e^*^ (A-1) 


where F(t) - Greenness - 25, 

t = shifted day of year - 125, 
a,b,c = model parameters. 

This form is a smooth curve that can fit a series of observations of 
spring small grains targets. This model was originally applied at • 
ERIM to crop calendar shift estimation [ 18 ] in a refinement of a 
technique developed by G. D. Badhwar [ 16 ] in which a simpler pro- 
file form was used. 

More recently, the same model has been used to describe particular 
features of spectral development. Appendix G describes a machine 
labeling technique for distinguishing Spring Wheat and Barley that 
employs the maximum value of the profile as an indicator of moisture 
stress conditions on a sample-segment level. A similar application 
has been carried out by UCB in the context of detection of episodal 
events [ '^5 ] , 

Profile technology, and the described model form, have also been 
used as a basis for a classifier both of spring small grains and 
corn [46]. 
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A. 1.4 LEVEL 3 PROFILE APPLICATION 

Figure A-2 illustrates some of the potentially useful profile- 
based features which could be extracted at Level 3. The set of fea- 
tures in toto provides a' comprehensive description of the target 
spectral development pattern. The maximum value of the profile can 
serve as an indicator of crop vigor and percent cover (asstiming a 
vegetation Indicator such as Tasseled-Cap Greenness is used) . The 
rates of Greenness increase and decrease, and changes- in those rates, 
can serve as useful features for distingusihing crops whose profiles 
have similar overall shapes' but which develop differently at some 
points in the season. These same rates can also offer information 
on crop condition. Similarly, the total development time (the inter- 
val between departure from and subsequent return to some nominal base 
value) or half-amplitude Interval could provide information relevant 
to both crop identification and condition assessment. 

The set of features, perhaps with additional profile-derived 
features, should provide a more cohesive description of the target, 
allowing broader Inferences to be made relativ^v.to both labeling 
and, where the two are separate, assessment of crop vigor. 

A. 1.5 PURPOSE OF CURRENT INVESTIGATION 

The work reported herein was undertaken for two major reasons. 
First, the promising aspects of Level 3 application (overall develop- 
ment characterization) prompted an interest in determining whether 
the model currently in use and developed for Level 1 or 2 use, could 
be extended into the arena of Level 3 application. Second, it was 
deemed important to study the various steps in the profile-fitting 
process itself and to define a procedure that could serve to standard- 
ize application of the technology in the community. 
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A. 2 STUDY OF PROFILE-FITTING FOR OVERALL CROP DEVELOPMENT CHARACTERIZATION 

A. 2.1 GENERAL STEPS OF A FITTING PROCEDURE 

Any procedure by which a profile model is fit to a set of data 
points would likely include the following steps: 

a. Data Selection : The target to which a profile is to be fit 

must be specified. This target may be individual pixels, 
all pixels of a field or quasi-field, or the mean of the 
field or quasi-field. 

b. Data Preparation : The Landsat data may require several steps 

of preprocessing to arrive at a standardized, normalized set 
of observations that can be converted into a smooth profile. . 

c. Parameter Estimation : Once the data is prepared, the values 

of the profile parameters must be estimated. 

These steps will be treated separately in the following description 
of study approach and results, with the data selection step being dis- 
cussed after the other two steps. 

A. 2. 2 DATA PREPARATION 

In order to utilize the information obtained by fitting a profile 
to a set of observations, to obtain meaningful information about overall 
crop development, it is necessary that the influence of external 
phenomena on target spectral appearance be reduced to the greatest 
possible degree. Variations in profile shape must be tied to crop 
type or condition and not to sun angle, haze condition, etc. 
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The series of normalization steps includes: 

a. Sensor Calibration (e.g., Landsat 2 to Landsat 3). 

b. Sun Angle Correction (cosine). 

c. Screening to flag clouds, shadows, etc. 

d. Haze Correction (spatially varying XSTAR) [37]. 

A second stage of data preparation is estimation of crop calendar 
shift. This procedure standardizes data to an arbitrary but common 
time scale, thus reducing signal variation on any given day which is 
the result not of crop appearance differences per se but rather of 
differences in stage of development at the time of observation. This 
process not only reduces signal variation on any given day, but pro- 
vides, in the case of fitting to all pixels of a’ field, added informa- 
tion that can be utilized in fitting the profile. 

The final stage of data preparation involves translation of the 
data axes such that the origin approximately corresponds to the start 
of Greenness development of the small-grain target. The need for this 
translation or offsetting stems from the model form itself. Regard- 
less of the actual starting Greenness value or starting time of Green- 
ness development, the model will consider all times from t=0 through 
some maximum time, and F(t)=0 through some maximum Greenness. The 
desired Sigmoid shape of each side of the -profile similarly occurs in 
the range from F(t)=0 through the maximum value. Figure A-3 illus- 
trates the effect of omitting this step. Despite the fact that the 
first Greenness observations occur well away from the untranslated 
origin, the model form begins there, and requires parameters that 
allow for a" long, relatively flat tail followed by a relatively rapid 
increase and decrease in the data range. Figure A-4 illustrates the 
same data fit with offset applied. Comparison of the two figures 
clearly points out the model's inability to accommodate both the long 
tail and the comparatively short span of actual crop development. 
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Day of Year 


FIGURE A-4. COMPARISON OF PROFILE FIT WITH AND WITHOUT OFFSET 

(Segment 1663 Field 7) 
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The resulting profile is significantly less accurate in overall 
development characterization than that computed using offset data. 
Variations in the method of parameter estimation, to be discussed 
next, can reduce this negative impact, but do not eliminate it. 

Base on data from several segments, offsets of 125 days and 25 
counts of Greenness (after adding 32 to all Tasseled-Cap channels) 
have been used. 

A. 2.3 PARAMETER ESTIMATION 

Linear vs . Non-Linear Techniques ; The method used to arrive at 
model parameter estimates has, in the past, exploited the ability to 
linearize the profile model. A logarithmic transformation produces 
the following linear model from:' 

In E(t) = In a + b In t + ct^ (A-2) 

Multiple linear .regression can then be applied to produce least squares 
estimates of the model parameters. 

While this method is simple and inexpensive, it has at least two 
disadvantages. First, the non-linear nature of the logarithmic trans- 
formation results in a distortion of the original data, and most 
importantly a compression of the peak of the set of observations. The 
least squares fit to these compressed data will, as a result, under- 
estimate the peak of the untransformed data. 

The problem is compounded by another aspect of the linear esti- 
mation technique. The least squares estimate in log space minimizes 
the quantity 


2 (In F(t) - In.G^(t))^ 


(A-3) 
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where F(t) = profile value, 

G^(t) = data value. 

Since the difference of two logarithms is the logarithm of the 
ratio of the two quantities, the quantity being minimized is, in fact. 

Since a given number of counts difference results in a larger ratio 
when the profile and data values are small than when they are larger, 
the linear least squares estimation procedure is giving greater weight 
to the tails of the profile in determining the best fit. Fit at the 
peak will be sacrificed, to some degree, in order to improve fit at 
the lower values. Not only does this accentuate the low peak estimate 
problem, it gives least Importance to those parts of the profile (at 
and around the peak) that in many applications will be the most 
important. 

An alternative method of parameter estimation involves a non- 
linear least squares technique. A routine from the IMSL package [47] 
was chosen which applies a modified Levenberg-Marquardt steepest 
descent algorithm to seek out the minimum value of a residual sum 
of squares surface through Iterative estimation and evaluation of 
parameters. This technique avoids the need for a log transformation 
and thus the technical problem associated with that transformation. 

Comparison of the two methods required definition of a common 

2 

measure of goodness-of-f it since the R from the linear regression 
is a measure of fit in log space rather than actual data space. As 
such, the following was used for comparison: 

S (F(t) - G^(t))^ 

Goodness of Fit = 1 — (A-5) 

Z (G^(tO - Gy 
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where F(t) = profile value, 

G^(t) = ith data value, 

'C = mean of all data values, 
and the range of the equation is [0,1]. 

Figure A— 5 and Table A-1 illustrate the results of the comparison. 
In all cases the non-linear technique provides an improved estimate 
of profile parameters, as evidenced both visually and empirically. 

Offsetting Problems Encountered ; In the course of the parameter 
estimation study, an apparent problem related to data offsetting (see 
Section A. 2.2) was encountered. Figure A-6 illustrates the problem, 
which appears to be the result of incorrect offset values. In both 
fields shown. Greenness values at shifted day 125 (offset day 0) are 
well above 25 (offset value 0) . While the result varies with the 
parameter estimation technique, the impact in both is a clear reduc- 
tion in profile accuracy. It is apparent, then, that the offset 
values used as standards are not appropriate in all cases. 

Several approaches to field-specific offset determination were 
considered. First was an approach that assumes a constant Greenness 
value before crop development begins. A straight line is drawn 
through a pair of pre-peak observations and the intersection of the 
line with the constant value is assumed to be the start-up time for 
crop spectral development. The major drawback of this approach is 
it’s inability to address the non-linear character of Greenness 
development. Figure A-7 illustrates this problem using simulated 
data. The lines drawn represent the results of using different 
18-day pairs of data points along the profile. While many do seem 
to point to the same starting date, those that include an observa- 
tion near the peak significantly mis-estimate the start-up time. 
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TABLE A-1. COMPAI^ISON OF GOODNESS-OF-FIT OF 
LINEAR VS, NON-LINEAR ESTIMATi^S 


Goodness-of-Fit 


Segment 

Field 

Linear 

Non-Linear 

1663- 

5 

.946 

.959 


7 

.919 

.932 


11 

.933 

.960 


. 12 

.920 

.929 


■13 

.914 

.941 


14 

.953 

.963 

1669 

28 

.819 

.834 


30 

.774 

.860 

1929 

16 

.883 

.890 


17 

.574 

.709 


23 

.725 

.842 


24 

. .886 

.923 
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FIGURE A-6. RESULTS OF INCORRECT OFFSETTING 
(Segment 1929 Field 23) 
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This result is further illustrated in Figures A-8 and A-9, using shifted 
and unshifted actual data. 

A second approach considered was inclusion of the offsets in 
model parameters to be estimated. The least squares fit would then 
be driven by five parameters instead of three. Attempts to apply the 
method, however, were unsuccessful, apparently due to parameter inter- 
actions . 


Finally, an attempt was made to determine field-specific offsets 
by iteratively calculating profiles while changing offset, and selec- 
ting offsets based on the goodness-of-fit measure. However, as 
illustrated' in Figure A-10, successive parameter estimates with a wide 
range of day offsets produced no clear optimum fit. 

An alternative to target-specific offset determination is develop- 
ment of a model that is less constrained by offset requirements. 

Development of a New Model : In response to the need for less 

dependence on offsetting, a new model was developed of the form: 


b (t-t ) 

ae ; t < t^ 

ae ^ P ; t > t 


where F(t) = Greenness-25 

t = shifted day of year, 

t = reference day of peak Greenness, 
P 

a,b^,b 2 = model parameters, 
and furthermore, 

a + 25 = estimated peak Greenness. 


(A-6) 


The model is fit to observations before and after the peak inde- 
pendently, while maintaining continuity (to the first derivative) at 
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the peak. The "zero-point" for time is now the time of peak Greenness, 
which is a relatively stable quantity after crop calendar shift esti- 
mation. 

Since the time value enters into the equation only as (t-t^) , 
the offset is taken care of automatically, and only crop calendar shift 
must be estimated beforehand. Further, while an offset in Greenness 
is still desirable, the elimination of a need for a (0,0) starting 
point also reduces the requirement for precision of the Greenness 
offset. 

The two models were compared, using the non-linear estimation pro- 
cedure, using a number of fields from several segments. In most cases, 
the new model provided a more accurate fit than the original model. 

In one segment, Segment 1663, many of the fields exhibited a near- 
linear increase in Greenness early in the season. The new model, 
which tends to produce a more distinct Sigmoid shape both before and 
after the peak, proved unable to fit the early acquisitions in many 
of the fields tested in Segment 1663 (Figure A-11). However, the fit 
over the remainder of the growing season was accurate. 

The -improvement with the new model is most distinct on those 
fields for which offsets were clearly a problem. Figures A-12 and 
A-13 demonstrate the improvement in fit with the new model. 

An additional advantage of the new model form is its flexibility. 
Since the model is fit in a piece-wise fashion, i.e. , before peak and 
after peak, it is able to adapt to a wider range of profile shapes. 
Figure A-14 illustrates some of the possible forms achievable with the 
model. 
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FIGURE A-13. COMPARISON OF MODEL FORMS 
(Segment 1929 Field 23) 






A. 2. '4 DATA SELECTION 

The final aspect of profile fitting to be addressed, and the first 
step in the procedure, is data selection. Two alternatives were con- 
sidered: field interior pixels and field means. Most of the tests, 

however, are applicable to other possible targets; individual pixels 
should be much like field means, and quasi-fields should correspond to 
fields, either means or pixels. It should first be noted that in 
research efforts where field boundaries are determined by ground truth, 
mis-registration will likely require that a one-pixel inset be applied 
at the field edges in order to get a good field signature. 

Application of the non-linear fitting technique with either model 
to field means resulted, in most cases, in a profile that appeared 
qualitatively the same as one produced using all interior pixels. 

Two problems, however, are apparent at least on a theoretical level. 

First, estimation of crop calendar shift, followed by estimation 
of three model parameters, computationally requires .at least three 
independent acquisitions in the range of the profile (more than three 
are desirable). Assuming a growing season of approximately 90 days, 
and data from only one satellite (i.e., every 18 days), five is the 
maximum number of observations possible. While two satellites increase 
the number potentially available to ten, the loss of a few acquisitions 
due to cloud cover could reduce the total available to a critical level. 

In addition, in large fields and to a lesser degree in nearly all 
fields, there can be expected to be some variation in stage of develop- 
ment within the field. Where this .is the case, shifting individual 
pixels will provide additional information which can be used to better 
characterize the overall shape of the profile. Thus, one could expect 
some degree of information loss when using field means. However, since 
the amount of information lost is unlikely to be critical, by far the 
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more serious concern is the number of observations available for esti- 
mation purposes. If indeed sufficient acquisitions are available, 
using field Interior means in place of field Interior pixels should 
be acceptable. 

A. 3 PROCEDURE FOR FITTING PROFILES TO TARGET DATA FOR LEVEL 3 

applications 

A. 3.1 DATA SELECTION 

Utilize targets defined by field or quasi-field boundaries, with 
one-pixel insets at borders to account for misregistration. If suf- 
ficient acquisitions are available (at least five) , means may be used 
in place of Interior pixels. 


A. 3. 2 DATA PREPARATION 

The following steps are prescribed for normalizing and standardizing 

data: 

• Satellite calibration. correction (see Appendix H). 

• Cosine sun-angle correction. 

• Cloud, shadow, etc detection [3 3* 

• Spatially- varying XSTAR haze correction [3 ] . 

• Estimation of drop calendar shift [ 3 ]. 

• Offsetting of Greenness values by 25 counts . 

A. 3. 3 PARAMETER ESTIMATION 

Estimation of parameters for the new model (Equation A- 6 ) should 
be carried out using a non-linear least squares technique. 


28 




APPENDIX B 

ANALYSES. OF SPECTRAL-TEMPORAL CHARACTERISTICS OF WHEAT 
B.l INTRODUCTION 

The overall objectives of the analyses described in this appendix 
were twofold. The first was to develop discriminative features from 
spectral data acquired for wheat throughout the growing season. The 
second was to develop an improved understanding of the spectral charac- 
teristics of wheat as functions of the spectral space employed and stage 
of development . 

This appendix has four major topics. Section B.2 addresses methods for 
analyzing and interpreting patterns in reflectance data in terms of 
corresponding features in Landsat data. Section B.3 is characteristics 
of spectral measures of wheat green development, comparing the selected 
Greenness feature with other measures such as spectral band ratios. 

'The Section B.4 topic is effects of moisture stress and Section B.5 is the 
estimation of crop development stage from temporal-spectral profiles 
of Greenness and crop calendar shift calculations. 

The data sets analyzed include a sequence of field measurements 
of wheat reflectance made by USDA personnel in Phoenix, Arizona, [48] 

and several North Dakota segments from the LACIE Transition Year 1978 

* 

Landsat data base. Table B-1 further describes these data sets. 

B.2 INTERPRETATION OF REFLECTANCE DATA 

It is well established that the majority of variability in Landsat 
data is confined to two dimensions conveniently described by the 
Brightness-Greenness plane of the Tasseled-Cap Transformation. Pt was 

desired to analyze inband reflectance measurements „in a way that corre- 
sponds to that transformation. In previous work [’49] , regression 
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TABLE B-1. DESCRIPTION OF DATA SETS 


A. FIELD-MEASURED REFLECTANCES (Dr. R. Jackson, et al, USDA) 

Site: U.S. Water Conservation Laboratory, Phoenix, Arizona 

Date: 1977-1978 growing season 

Experiment Obj active : 

Determine water stress effects on wheat yield and 
spectral characteristics 

Experiment Factors: 

Moisture treatment - six levels of irrigation 

Crop - Two spring wheat varieties and one barley 
(Planting densities greater than normal 
for Northern U.S. Great Plains) 

Spectral Measurements : Hand-held Landsat-band radiometer 

Frequency: Every other day, weather permitting 

Agronomic Measurements : 

Crop development stage (modified Feekes scale) 

Leaf area 

Stem length 

Wet and dry weights 


B. LAND SAT DATA 

Sites: North Dakota Segments 1392, 1457, 1461, and 1636 

Date: 1978 growing season (See Table B-2 for listing of 

individual acquisition dates) 

Processing Applied: 

Calibration adjustments for Landsat 3 to Landsat 2 
Sun angle correction 

Haze correction using spatially varying XSTAR algorithm 
Tasseled-Cap transformation 
Selection of field interior pixels 
Computation of spectral means for each field 
• Calculation of day shifts using spectral means 

Agronomic Measurements: 

Crop development stage (modified Feekes scale) 
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relationships were established between Landsat values and field-measured 
(by helicopter-borne spectroradiometer) reflectances for individual 
fields on several dates. Also, a principal component analysis was con- 
ducted of inband reflectances. Ninety-nine or more percent of the varia- 
bility was found to lie in a plane and a Tasseled-Cap-like transformation 
was defined by determining two directions, one visually aligned with 
bare soil data in the principal plane and the other orthogonal to it in 
the direction of green vegetation reflectance values. 

A modified approach was followed this year. The principal component 
analysis step was repeated, but a different procedure was used to estab- 
lish the soil line. All bare soil data were isolated and subjected to 
second principal component analysis; the major component of this analysis 
was then defined to be the brightness direction, with an orthogonal 
greenness dimension. In addition, the Inband reflectance values were 
first multiplied by band-to-band calibration ratios based on the cali- 
bration values of Landsat. The objective was to provide a weighting of 
values in 'the various reflectance bands that was more comparable to the 
weighting applied by Landsat (for instance the range of MSS7 data is 
half that of the other bands) . 

It would be desirable in the future to re-do the regression-type 
analysis of Reference [49] using data from other sites. One reason is 
that the prior study did not use the spatially varying XSTAR algorithm. 
Another is that uncertainties exist about the calibration of those early 
helicopter-borne reflectance measurements. 

As with the earlier principal component analyses, 99% or so of the 
variability in the data was found to lie in the principal plane. Thus, 
Indications are that senescing vegetation does riot lie out of this plane 
by any significant amount. 
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B.3 MEASURES OF GREEN VEGETATION DEVELOPMENT 

A number of Landsat-derived green measures have been proposed and 
used by various investigators. Along with Tasseled-Cap Greenness, the 
list includes MSS7/MSS5, v1iSS7 /MSS5 [ 50 ], Green Angle [ 51 ], Vegetation 
Index (VI) [ 52 ] > Transformed VI (TVI) [52lj among others. We ex- 
plored the characteristics of these measures using the set of field- 
measured reflectance data. 

Several criteria are appropriate for selecting a green measure for 
a particular application. These include the shape of its time profile, 
its stability, its correlation with agronomic variables, and its useful- 
ness for discrimination. The shape is an important factor for use in 
crop calendar shift operations. For instance, ease of fitting a mathe- 
matical form to the profile shape and representation of the overall 
development process are important. For crop development stage estima- 
tion, emphasis of particular development stages can be important. As 
far as stability is concerned, it is desirable to have both a low vari- 
ance and a low sensitivity to selected other factors, such as soil color. 

Figures B-1 through B-3 present spectral-temporal profiles for the 
different variables. These profiles were obtained from smoothed reflec- 
tance measurements from one plot of data. With cautions that the shapes 
might be different in Landsat data due to path radiance effects and that 
only one test plot is represented, a few observations can be made. While 
there are similarities in the shapes, there also are differences. The 
peaks of some, especially TVI, are broad and relatively flat. A broad 
flat peak and steep edges is not the ideal shape for shift calculations; 
a triangular profile would be better. The peaks of most occur later than 
that for Greenness . 
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The difference in day of peak for Greenness and Band 7/Band 5 is 
interesting. Insight into the reasons for this can he gained from a 
plot of the time track of wheat in the plane of Band 7 vs. Band 5 (See 
figure B-4). The peak in Greenness corresponds to the peak in Band 7 
which occurred at growth stage 9 or 10 which is just prior to heading 
(See definition of modified Feekes scale in Figure B-5) . The Band 7/ 
Band 5 ratio, on the other hand, reaches its peak when a radial line 
through the origin is tangent to the left hand side of the loop; this 
occurred near the end of heading. 

It appears that no single green measure will be optimum for all 
purposes. Landsat data will be needed for final characterization and 
selection for specific applications. We chose the Greenness variable 
as the measure of green vegetation for the investigations conducted 
during the year- 

B.4 EFFECTS OF MOISTURE STRESS 

The time track in Figure B-4 is for a wheat field that had a dry 
moisture treatment. Figure B-6 is for a field that had a wet treatment. 
While both exhibit the looping pattern, the peak value of Band 7 is 
higher for the wet treatment which had more dense vegetation. The 
differences were even more pronounced for fields that received more 
extreme moisture treatments. The wettest field suffered from lodging 
which caused a marked change in its spectral track. 

The Tasseled-Cap-like transformation performs primarily a rotation 
of the time-track pattern, preserving its distinctive features. Figure 
B-7 shows the reflectance-space Greenness vs. Brightness plot for the 
field shown in Figure B-4. 
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FIGURE B-4. TIME TRACK OF WHEAT REFLECTANCE: PLOT 6A DRY TREATMENT 
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FIGURE B-6. TIME TRACK OF WHEAT REFLECTANCE: PLOT 3A WET TREATMENT 
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FIGURE B-7. TASSELED-CAP-EQUIVALENT SPECTRAL TRAJECTORY 
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B.5 ESTIMATION OF CROP DEVELOPMENT STAGE 

The spectral time track patterns in the preceding figures show a 
strong correlation with the stage of crop development. We see that 
peak Greenness occurred just prior to heading, during development of 
the flag leaf. Reflectances generally decreased proportionally toward 
the origin during heading to the milky-ripe stage (11.1) of ripening, 
perhaps due to the opaqueness of heads -resulting in increased shadowing. 
As ripening progressed, the Brightness began to increase again and 
Greenness continued to decline (See Figure B-8). 

Rates of wheat development can vary substantially from location 
to location. Figure B-9 presents a comparison of selected ground- 
measured development profiles for spring wheat grown in Arizona- and 
North Dakota. It is such profiles that one wishes to estimate using 
spectral data. Some of the problems that must be overcome are evident 
on this graph. First, the Modified Feekes Scale is not linearly related 
to calendar date; it is compressed at the high end which corresponds to 
heading and ripening stages. Geographic factors can have an effect, 
although the planting of spring wheat in mid-December in Arizona is an 
extreme departure from the more common spring planting in the Northern 
D.S. Great Plains. The characterization of North Dakota profiles by 
'the 18-day observations is less complete than desirable, but that is 
the usual interval that Landsat will provide under cloud-free conditions. 

Underlying both types of curves are uncertainties and variability in 
ground observations for the fields of interest; some Feekes-scale desig- 
nations are not readily discernible without detailed examination and 
within-field variations do exist. 

Plots of development stage vs. day of year are presented for 13 
fields of North Dakota Segment 1461 in Figure B-10 for which periodic 
ground observations of development stage were made. The scatter of 
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points before shifting (Part a) appears’ greater than afterwards (Part 
b) , especially in the portion corresponding to unshifted day 173. Cal- 
culated crop calendar shifts for these fields were used to place the 
estimated peak Greenness value at shifted day 160 in Part b. In field 
measurements data, this peak usually occurs just prior to heading, e.g., 
at or just before Stage 10 (boot). Days shifts calculated for other 
segments are summarized in Table B— 2. 

An appreciation for the effectiveness of the crop calendar shift 
calculation can be gained from Figure B-11. Here, the reduced scatter 
of the spectral data after shift is striking. Less pronounced improve- 
ments were achieved for two other segments. 

This initial work in crop development stage estimation did not 
reach a conclusive stage. However, some issues were identified that 
could affect future work. It is recommended that ground observations 
of crop development stage in selected fields in segments be made more 
frequently than at 18-day intervals , even though spectral data may not 
be acquired more frequently. The sparseness of Landsat data, especially 
with missing acquisitions will limit accuracy. The use of green profile 
shift technology appears helpful but improvements involving more complete 
characterization of profiles (e.g., level 2 or level 3 applications) 
and/or use of other spectral features, e.g., Brightness, may be required. 
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TABLE B-2. ANALYZED SUBSET OF TY 1978 SPRING WHEAT BLIND SITES: 
LAND SAT DATA (BOTH LAND SAT 2 AND 3) 



p- 
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Acquisition Days 
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Shi^t 
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APPENDIX C 

ANALYST LABELING/PROCEDURE M EXPERIMENT 

The analyst labeling/Procedure M experiment represents a natural 
continuation of the Procedure M experiment reported in Reference S . 

The techniques of testing and evaluating both Procedure M and its com- 
ponents have been refined. The use of analyst labels in this experi- 
ment represents an increase in scope over last year's Procedure M 
experiment. Section 4.9 of Volume I summarizes the results of the 
analyst labeling study and Section 6.2 describes tests of components 
of Procedure M. This Appendix provides details of the experiment for 
completeness . 

C.l PURPOSE AND SCOPE OF EXPERIMENT 

1. Gain understanding of analyst labeling of field-like targets. 

a. Examine the use of field-like targets for labeling. 

b. Evaluate individual analyst performance in labeling 
targets as functions of : 

• target level variables, 

• spectral trajectory strata, and 

• segment-level variables. 

c. Evaluate joint analyst performance in labeling 
targets . 

9 Examination of analyst consistency. 

• Comparison of individual analyst performance to 

average, vote, and consensus labeling performance. 

• Evaluation of low confidence and mixed target labels. 

d. Evaluate analyst performance and consistency in 
estimating crop proportions. 

e. Establish a data base of AI labels for evaluation of 
Procedure M. 
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2. Test and evaluate Procedure M. 

a. Estimation of performance parameters. 

b. Evaluation of the components of Procedure M. 

c. Study of error propagation by the components of 
Procedure M. 

C.2 DESCRIPTION OF THE EXPERIMENT 

C.2.1 PRELABELING PROCESSING OF 18 TY 1978 NORTHERN GREAT PLAINS 
LACIE SEGMENTS 

1. Sun angle correction. 

2. Machine screening. 

3. Satellite calibration. 

4. Haze correction by spatially varying XSTAR. 

5. Tasseled-Cap data transformation. 

6. Clustering pixels which are close to one another spectrally 

(Brightness- Greenness) and spatially (line-point) into field- 
like forms called blobs. 

7. Construction of 117 x 196 line printer maps such that: 

e Pixels from little blobs (those with no interior pixels) 
are printed as This choice of symbol was a mistake; 

the analysts had much to say about it. 

• Pixels on the boundary of big blobs (those blobs with at 
least one interior pixels) were printed with a unique 
character . 

» Pixels in the interior of big blobs were printed with a 
blank. 

• Production by JSC of PFC image overlays in which all 
pixels not in the interior of some big blob were blacked 
out. 

C.2.2 LABELING OF TARGETS 

1. Sampling of big blobs : Blobs which have at least one interior 

pixel are called big blobs. These big blobs are currently candidate 
labeling targets subject to being sampled. In an operational mode. 
Procedure M would sample on the order of 100 big blobs from each segment. 
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In order to investigate various sampling strategies and to investigate 
various analyst labeling attributes, all big blobs were labeled by 
three LACIE-experienced analysts . 

2. Prelabeling training/ labeling procedure adjustment : In order 

to study various parameters associated with error models and analyst 
consistency, the analysts were asked to work independently. No spec- 
tral aids were available in the time frame in which we were conducting 
this experiment. In order to adjust for the difference in labeling 
targets, lacks of spectral aids, and the Independent labeling, the 
analysts initially were given about one day to discuss labeling problems 
and procedures to be used in blob labeling. Segment 1392 was studied 
during this session, but no labeling was performed at that time. Seg- 
ment 1392 was the last segment labeled by the analysts. The analysts 
later were asked to list any segments in which they had to depart from 
this procedure which they developed. All analysis techniques (i.e., 
image interpretation), aside from those differences mentioned above 
were to be the same as those used during LACIE TY operations. 

3. Labeling procedure for Procedure M test ; The procedural 
steps for labeling blobs for the Procedure M. experiment follow. 

a. Each analyst will label the blobs in the 18 TY segments 
in the order given in Table C-1. 

b. Labeling will be performed independently by each analyst. 

c. Each analyst will relabel the first three segments after 
all 18 segments have been labeled (see Table C-1) . 

d. After labeling each segment, the analyst will fill out 
the ’’segment comment forms" shown in Figure C-1. 

e. After labeling all 18 segments and prior to relabeling 
the first three segments, the analyst will complete 
the "final comment form" shown in Figure C-2. 
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TABLE C-1. 

Analyst 

1656 

1825 

1457 

1461 

1619 

1602 

1650 

1566 

1473 

1653 

1835 

1636 

1612 

1380' 

1467 

1920 

1518- 

1392 


THREE RANDOM PERMUTATIONS OF 18 SEGMENTS 


Analyst B Analyst C 


1636 

1653 

1619 

1612 

1518 

1650 

1602 

1602 

1650 

1457 

1653 

1656 

1825 

1380 

1612 

1920 

1461 

1835 

1380 

1636 

1566 

1518 

1457 

1473 

1656 

1467 

1467 

1619 

1920 

1825 

1473 

1566 

1835 

1461 

1392 

1392 
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(Use other side if needed) 

B. What are the deficiencies, if any, in the available acquisition 
history? 

6. Comments About the Blob Patterns: 

A. Do the blob interiors seem to be pure? 

B. Do the blob patterns match the field patterns? 

C. Was the choice of blob acquisition optimal? If not, why? 

Which acquisitions should be used? 

D. Other comments about blob patterns: 

7. General Description of Segment. For example, moisture, field size, 
topography, percent agriculture, etc. 

8. Did you have to change your procedure in order to handle this segment? 
If so, describe how. 


9. Other Comments: 


FIGURE C-1. SEGMENT COMMENT FORM FOR PROCEDURE M TEST 
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1. Analyst's Code; 


2. Analyst's Background and Years of Experience in LACIE: 


3. Analyst's Overall Impression of Blob Interiors as Labeling Targets 
(Ease of Labeling, Purity, Ease of Finding, Comparison with Dot 
Labeling) : 


4. List the Major Problems and Strong Points of Blob Labeling: 


5. Comments on Suitability of Image and Map Products: 


6. ^^hat Procedures Did You, Develop to Organize and Use the Product? 


7. Recommendations for Improvements in Products, Procedures, etc; 


FIGURE C-2. FINAL COMMENT FORM FOR PROCEDURE M TEST 
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Detailed blob labeling instructions are given below. 

• For each segment, blob overlays keyed to the 
LACIE sample segment products (Products 1, 2, 

and 3) have been generated as well as line printer 
maps of the blobs for each segment. The line 
printer maps will be used by the analyst to record 
the label for each blob. 

• A blob is to be labeled spring small grains if it 
is at least 50% spring small grains, and labeled 
non-spring small grains if it is less than 50% 
spring small grains. When there is considerable 
question about which label to assign a blob, the 
blob will be labeled based upon the analyst's best 
guess. If a blob is mixed (i.e., composed of 
approximately 50% spring small grains and non- 
spring small grains) it will be flagged and 
labeled on the line printer map . 

• The label for each category will be coded on the 
line printer map as follows: 

- Spring Small Grains - Red. 

- Non-Spring Small Grains - Green. 

- Questionable Spring Small Grains - Question 
Mark (?) on Red blob. 

- Questionable Non-Spring Small Grains - Question 
Mark (?) on Green blob. 

- Mixed blob - The blob will be labeled using the 
50% criterion and then outlined in blue. 

• The analyst should examine the acquisition listing 
in Table C-2 in order to become familiar with the 
acquisitions used for generating the blobs. 

• Grid overlays (10 pixels by 10 scan lines) will be 
keyed to the line printer map in order to facili- 
tate analysis. 



TABLE C-2. TRANSITION YEAR DATA^ USED FOR SPRING WHEAT SEPARABILITY EXPERIMENT 


SEGMENT NUMBER 

A/D #1 

A/D #2 

A/D #3 

A/D #4 

A/D #5 

A/D #6 

A/D #7 

1380 

78/169 

78/196 

78/205* 

78/222* 

78/232 

78/241* 

78/249 

■1392 

78/136 

78/154 

78/190* 

78/208 

78/217* 



1457 

78/156 

78/174 

78/228* 

78/246* 

78/264* 

78/273 


1461 

78/136 

78/155* 

78/190* 

78/199 

78/209* 

78/217 

78/236 

1467 

78/136 

78/154 

78/190 

78/199* 

78/208 

78/218* 


1473 

78/116 

78/197* 

78/207* 

78/224* 

78/269 



1518 

78/116 

78/135- 

78/153* 

78/188 

78/206 

78/224* 

78/243* 

1566 

78/115 

•78/133 

78/169* 

78/196 

78/232* 



1602 

78/174* 

78/211* 

78/228* 

78/264 




1612 

78/118 

78/137 

78/155* 

78/199 

78/218 

78/236* 


1619 

78/135 

78/198* 

78/207* 

78/216* 

78/243 

78/252 

78/270 

1636 

78/135 

78/154 

78/190 

78/208* 

78/216* 

78/226 

78/243* 

1650 

78/156* 

78/191 

78/209* 

78/218 

78/228 

78/236* 

78/246 

1653 

78/119 

78/136 

78/155* 

78/191* 

78/199 

78/208* 

78/217 

1656 

78/137 

78/155* 

78/191 

78/209* 

78/218 

78/263 


1825 

78/133 

78/169* 

78/196 

78/206* 

78/224* 

78/232 

78/250 

1835 

78/134 

78/170* 

78/187 

78/196* 

78/224* 

78/232 

78/241 

1920 

78/136* 

78/199 

78/209* 

78/218* 

78/236 

78/271 



‘Note acquisition dates shown are the subset of available acquisitions which were selected 
for digital processing. 


The acquisition dates marked with an * were used to produce the accompanying blob maps. 




• Each of the 18 segments will have a packet con- 
sisting of the same material (ancillary data, maps, 
normal crop calendars, crop calendar adjustment, 
etc) used during LACIE TY operations. The excep- 
tion to this is that the spectral aids will not be 
used for crop identification (because they do not 
match the blobs) . 

• All analysis techniques (i.e., image interpretation 
techniques) will be the same as those used during 
the LACIE TY operations . 

Note; It is anticipated that the time required 
to label and complete the evaluation forms will 
be approximately 10 hours/segment. 

C.2.3 DESCRIPTION OF THE DATA BASE 

ERIM has constructed data files containing spectral variables, 
ground truth, and analyst labels on the pixel, blob, and big blob levels. 
The big blob variables are also contained in an SPSS-formatted system 
file. 

1. Spectral variables ; The four Landsat bands were transformed 
into the Tasseled-Cap variables Greenness and Brightness. All IS seg- 
ments in this study have Greenness and Brightness variables for each 
acquisition on the pixel, blob, and big blob levels. 

2. Ground truth ; Ground truth for 17 segments (all except seg- 
ment 1835) is also contained in the same files as the spectral variables 
on the pixel level, blob level, and big blob level. The ground truth 

is given in subpixels and aggregated up to the pixel and blob levels . 

3. Analyst labels ; The analyst labels of big blobs are contained 
only in the SPSS system file. For each analyst, we have: 

a. grain/other label, 

b. low-confidence label, and 

c. mixed target label. 
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4. Special variables in the SPSS file : Several variables were 

created in- the SPSS file in order to address the objectives of this 
experiment. 

a. Purity of target. 

b. Size of target. 

c. Crop category (e.g., summer crop, spring crop, etc). 

d. Acquisition history. 

e. Crop calendar (profile shift diagnostics). 

f. Crop condition (peak green). 

g. Spectral trajectory stratum of target. 

C.2.4 DESCRIPTION OF THE ANALYSIS OF THIS EXPERIMENT 

Analyst labeling ; Analyst labeling was investigated both as a 
component in a segment crop proportion estimation procedure and as 
a process to be studied for understanding. The analysis of analyst 
labeling of blobs can be found in Section C.3 of this Appendix. The 
effect of analyst labeling on grain proportion estimates can also be 
found in Section C.3. Section C.4 describes how the evaluation of 
analyst labels as a component within Procedure M will be carried out. 

The analysis of analyst labeling of field-like targets was 
performed for the most part using descriptive statistics. These des- 
criptive statistics included; 

a. contingency tables, 

b. frequency tables, 

c. two-dimensional scattergrams, and 

d. plots. 

Also, some inferential statistical procedures were used to gain insight 
into the analyst labeling process. These included: 

a. ANOVA 

b. Regression 

c. Discriminant Analysis 
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Performance of Procedure M and its components : The reduction of 

variance of the BLOB and BCLUSTER components are given for each segment 
in Section C.4 of this Appendix. Histgorams are also given to examine 
the grain/other separation obtained by the BLOB and BCLUSTER components.' 
The effect of the labeling component and sampling strategies on bias 
and variance are also studied in Sections C.3 and C.4. 

The evaluation of Procedure M's performance is not completed at 
this time, however, the measurements and methods of evaluation to be 
used are given in Section C.4. 

C.3 INVESTIGATION OF ANALYST LABELING OF FIELD-LIKE TARGETS 

C.3.1 EVALUATION OF THE EFFECTS OF TARGET LEVEL VARIABLES 

Table G-3 gives the labeling accuracy for various sizes of the 
blob interior. The labeling accuracy of both grain and non-grain 
increases as the blob size increases in the range of 1-14 pixels after 
which the increase in accuracy tends to slow down or to level off. 

Table C-4 gives the labeling performance by purity of target. 

One sees immediately the non-symmetry of the labeling errors . Those 
blobs with 0-9% grain are (incorrectly) classified grain 2.2% of the 
time, while those blobs with 90-100% grain are (incorrectly) classi- 
fied non-grain 100-70.4 = 29.6% of the time. We also see that 
46.9 + 19.5 = 66.4% of the big blobs are in these two purity classes. 

The labeling accuracy of grain within the class 90-100% grain is less 
than that within the class 80-89% grain. This phenomenon is due to 
the much lower accuracy of labeling pure oat blobs; 420 of the 1182 
grain blobs within the 90-100% grain class are oats. If these 420 
blobs are excluded, then the accuracy is about 80%. 
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TABLE C-3. ANALYST LABELING ACCURACY (PCC) AS A FUNCTION 

OF QUASI-FIELD SIZE 


Analyst 


Size 

Vote 

Green 

Red 


Blue 

Strata* 

NG 

G 

NG 

G 

NG 

G 

NG 

G 

1 

91.5 

47.5 

88.6 

48.6 ■ 

86.8 

57.3 

93.0 

34.6 

2 

94.5 

49.2 

90.4 

51.6 

89.8 

57.1 

93.9 

44.8 

3 

94.5 

66.9 

92.4 

69.0 

89.5 

71.5 

93.2 

46.5 

4-6 

93.6 

63.9 

93.8 

66.4 

87.0 

68.2 

94.2 

47.5 

7-9 

93.4 

67.1 

92.6 

69.6 

87.3 

72.5 

96.2 

47.3 

10-14 

95.2 

72.7 

93.5 

72.8 ■ 

88.4 

75.5 

96.0 

51.2 

15-19 

94.0 

71.2 

95.7 

71.7 

88.2 

76.3 

94.3 

57.3 

20-28 

96.1 

76.6 

96.7 

72.9 

91.2 

79.1 

97.0 

61.1 

29-46 

97.6 

76.8 

97.9 

76.8 

93.6 

79.3 

97.3 

58.8 

47-120 

98.8 

70.4 

98.8 

78.1 

95.9 

72.8 

98.8 

55.6 


*Number of interior pixels in blob. 


TABLE C-4. ANALYST LABELING PERFORMANCE AS A FUNCTION OF QUASI-FIELD PURITY 


Blob Composition (% small grain) 



0-9 

10-19 

20-29 

30-39 

Percent Labeled 
Grain 

2.2 

5.8 

9.6 

18.5 

Strata Size 

2691 

569 

250 

211 

Weighted Per- 

cent Labeled 
Grain 

1.6' 

4.1 

7.0 

18.8 

Strata Size 

(Percent of 
Total Pixels 

46.9 

7.1 

3.3 

3.0 


in Big Blob) 


40-49 

50-59 

60-69 

70-79 

80-89 

90-100 

27.6 

44.1 ' 

63.9 

62.9 

72.6 

70.4 

174 

195 

204 

310 

453 

1182 

28.3 

47.5 

58.4 

66.3 

75.5 

72.2 

2.6 

2.8 

3.0 

4.4 

7.4 

19.5 
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Table C~-5 gives the analyst performance on major crop types. We 
note that, as in the case of Table C-4, non-grains are labeled more 
accurately than grains. It should be noted that the classification of 
the strip and unknown category is not knox-m. 

Table C-6 gives the percentage of grain labels for each of the 
grain crops. Spring wheat was labeled as a grain much more often than 
the others. Barley was more confusing to the analyst than wheat, and 
oats were more confusing than barley. There were only 14 fields of rye 
and the majority labeled 12 of them as a non-grain. 

Table C-7 gives a breakdown of Table C-6 by segment. Oats in the 
Minnesota segments (1518 and 1825) were labeled much better than oats 
in the North Dakota segments. We also note that the very poor labeling 
of grain in segments 1656 and 1920 is due to the mislabeling of oats . 
There are wide variations from segment to segment which suggest that 
the problems encountered by the analysts also vary widely from segment 
to segment. 

Table C-8 gives the percent of non-grain blobs labeled as grain 
for several crops. Flax turned out to be the only confusing non-grain 
to Analysts Green and Red, while Analyst Blue labeled all flax blobs 
as non- grain. 

The effect of profile characteristics on analyst labeling accuracy 
was examined by comparing the Greenness path of each grain blob to an ex-, 
pected Greenness curve. A relationship would imply that analyst label- 
ing accuracy is impacted by crop condition, as reflected through its 
Greenness trajectory. The standard Greenness curve is given by the 
expression: 

P(t) = .65163t^-^^^^exp(-1.00052415t^) 
where F = Greenness - 25 

< 

t = day of year - 125. 
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TABLE C-5. PERCENTAGE OF MAJOR CROP QUASI^FIELDS LABELED AS GRAIN* 



Percent 



Vote 

Analysts 
Green Red 

Blue 

Percent 

Purity 

of Pixels 
Considered 

•No. 

Segments 

Spring Crops 

71.6 

73.1 

75.3 

55.5 

93.3 

48.2 

17 

Summer Crops 

3.2 

3.3 

9.0 

1.8 

94.6 

41.6 

16 

Pasture and 
Grass 

0.8' 

1.0 

3.2 

1.5 

93.9 

74.3 

15 

Fallow 

3.2 

4.1 

9.3 

1.7 

91.8 

39.8 

16 

Miscellaneous 

1.2 

4.7 

1.6 

1.6 

94.3 

55.7 

15 

Unknown and 
Strip 

23.1 

11.5 

46.2 

46.2 

90.9 

55.9 

9 


TABLE C 

-6. 

PERCENTAGE 

OF GRAIN 

CROP 

QUASI-FIELDS 

LABELED AS GRAIN* 



Vote 

Green 

Red 

Blue 

Percent 

Purity 

No. 

Quasi-Fields 

Labeled 

No. 

Segments 

Spring Wheat 

82.9 

84.6 

84.9 

70.2 

91.1 

800 

17 

Barley 

64.3 

68.2 

75.3 

38.3 

92.6 

154 

12 

Oats 

38.5 

39.4 

40.0 

28.0 

96.4 

267 

17 

Rye 

15.4 

21.4 

23,1 

7.1 

93.0 

14 

5 


Quasi-fields at least 80% 


pure . 
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TABLE C-7. AI LABELING ACCURACY (PCC) OF MAJOR GRAINS BY VOTE^’^ 


Segment 

1380 

1392 

1457 

1461 

1467 

■1473 

1518 

1566 

1602 

1612 

1619 

1636 

1650 

1653 

1656 

1825 

1920 


Grain 

96.0 
90.5 

64.9 

81.9 

55.3 

89.9 

76.9 

90.4 

88.3 

47.4 
71.2 

59.4 
76.9 

71.1 
9.8 

82.1 
36.7 


Wheat 

94.4 

90.7 

81.8 

86.2 

50.0 

97.7 

72.7 

100.0 

91.8 

84.6 

92.6 

61.4 

66.7 

68.0 

90.9 

71.4 


Oats 

. 31.7 

30.8 

92.9 

16.1 

8.3 

86.4 

20.0 


Barley 


91.7 

78.0 

80.0 


18.3 


64.7 


^Quasi-fields at least 80% pure. 

2 

At least 10 quasi-field required. 
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TABLE C-8. PERCENT OF NON-GRAIN FIELDS LABELED AS GRAIN^ 



Vote 

Green 

Red 

Blue 

Percent 

Purity 

Percent 

Quasi-Fields 

No. 

Segment 

Alfalfa 

2.4 

2.4 

4.9 

4.8 

91.8 

42 

11 

Corn 

2.9 

2.2 

4.4 

1.5 

90.4 

137 

8 

Sunflower 

0.0 

0.0 

16.7 

0.0 

93.5 

■ 6 

4 

Sunflower 

2.7 

3.2 

14.7 

0.5 

93.7 

187 

10 

Soybeans 

1.7 

1.7 

1.7 

2.3 

94.8 

178 

4 

Sugar 

Beets 

0.0 

0.0 

0.0 

0.0 

93.4 

16 

3 

Flax 

21.4 

21.4 

44.8 

0.0 

91.3 

29 

9 

Potatoes 

0.0 

0.0 

0.0 

0.0 

92.4 

21 

1 

Pasture 

0.8 

0.8 

1.0 

1.0 

93.4 

393 

14 

^Quasi-fields are at 

least 

80% pure 

within 

the assigned 

class . 
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The model for the Greenness of blob i on day t .+125 is 

2 


G. (t.) = X.F(t.-T.) + 25 
1 J 1 J 1 

The parameters X and t were estimated for each blob. 

The estimate of peak greeness for blob i is 

Xil |f| |_^ + 25 = + 25 = X^(34.32) + 25 


Figure C-3 gives a histogram of labeling accuracy vs. blob shift. 

The intervals were chosen in' such a way that the numbers of blobs per 
interval were approximately equal. The grain blobs whose shift was be- 
tween -35 and -12 have an 80% chance of being correctly classified as 
grain. If the pure grain blobs with low peak Greenness values' are ex- 
cluded, then the labeling accuracy within the interval [-12,100] is 
about 80%. ' Those grain blobs whose profile differs by more than 35 days 
from normal have a much lower probability of being classified as grain. 
Much of the variation in the values of the blob shifts was explained by 
variations in the segment mean of these shifts. Figure C-4 gives a 
histogram of labeling accuracy vs. deviation of blob shift from the 
segment mean. The intervals were again chosen so that each had approxi- 
mately an equal number of blobs. Those grain, blobs which differed from 
the segment mean shift by less than 10 days were classified as grain’ 
more than 80% of the time. These two figures indicate a tendency to label 
according to an expected spectral pattern, with accuracy diminishing as 
the pattern shifts, due to variations in planting, either early or late. 

Figure C-5 gives the histogram of grain labeling accuracy vs. 
various levels of peak Greenness. Labeling accuracy is under 50% for 
those pure grain blobs with peak Greenness values under 46. This 
implies that grain fields that are either stressed or of low canopy 
cover, tend to be identified as non-grain. It was noted that a high 
peak Greenness offset the tendancy to mislabel in the presence of 
late shifts. 
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Vote Accuracy {%) 




: - Average Shift in Each Segment 

;y ^ 80 % 
ided 

>taj Segment 1619 Excluded 
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C.3.2 EVALUATION OF JOINT ANALYST PERFORMANCE 

Analyst consistency : Table C-9 gives the percent of decisions on 
which the pairs of analysts and all analysts agreed for each segment 
The highest consistency shown by all three analysts occurred in segment 
1380 where the analyst agreed on 92.7% of their labels, while the 
lowest consistency occurred in segment 1467 with only 55.7% agreement. 

. Table C-10 gives the percentage that pairs of analysts correctly 
agreed with each other for several crop types. For example. Analysts 
Green and Red both correctly labeled grain targets as grain 61.2% of 
the time and Analysts Red and Blue correctly labeled summer crops as 
non-grain 92.1% of the time. As noted earlier, the analysts do much 
better with non-grain than grains. 

Table C-11 gives, for several crops, the percentage in which none 
were correct (0 of 3) , one was correct (1 of 3) , two were correct (2 
of 3), and all were correct (3 of 3). For example wheat was mislabeled 
as non-grain by all analysts (0 of 3 were correct) 7.7% of the time, 
wheat was labeled as grain by only one analyst (1 of .3 correct) 8.9% 
of the time, wheat was labeled as grain by two of the analysts (2 of 
3 correct) 18.3% of the time, and all three analysts (3 of 3 correct) 
65.1% of the time. It is interesting that oats are missed by all three 
analysts more than half of the time. 

The columns of Table C-12 were constructed from the columns of 
Table C-11 using the following relations: 

t ^ P(2of3 correct) 

P (correct label 2 aoalyat agreed) - p(i correct)!-? (2 of 3 correct) 

and 

P (correct label 3 atalyat agreed) - p(o 3 ■ c - cLect)+P°rof 3 correct) 
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TABLE C-9. ANALYST CONSISTENCY - PERCENT OF DECISIONS IN AGREEMENT 

Red 


Segment 

Green 

Red 

Green 

Blue' 

Red 

Blue 

Green 

Blue 

1380 

94.9 

■ 96.4 

94.2 

92.7 

1392 

82.6 

73.9 

66.7 

61.6 

1457 

90.6 . 

83.4 

■83.5' 

79.1 

1461 

88.0 

83.6 

80.6 

76.3 

1467 

79.4 

60.5 

71.5 

55.7 

1473 

91.2 

84.3 

83.3 

79.5 

1518 

91.7 

82.2' 

80.5 

77.0 

1566 

91.4 

84.8 

V 

86.2 

81.0 

1602 

• 93.0 

91.8 

91.1 

88.0 

1612 

92.0 

90.2 

90.5 

86.4 

1619 

88.5 

85.9 ' 

'87.6 

80.0 

1636 . 

86.7 

■ 85.8 

84.4 

78.5 

1650 

80.8 

79.9 

81.3 

71.2 

1653 

73.9 

85.2 

76.5 

68.0 

1656 

91; 0 

92.8 

91.9 

87.9 

1825 

90.3 

89,0 

84.4 

81.9 

1835 

92.2 

92.9 

92.0 

88.8 

1920 

91.9 

89.0 

89.8 

85.2 
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TABLE C-10. ANALYST CONSISTENCY - PERCENT OE DECISIONS IN CORRECT AGRjEEMENT 


Analyst Pair 

Non- 

Grain 

> 50% 
Grain 

> 80% 
•Grain 

Wheat 

Oats 

Barley 

Summer 

Crop 

■Pasture 
& Grass 

Green 

Red 

87.0 

61.2 

77.4 

79.6 

33.6 

60.0 

99.7 

99.1 

Green 

Blue 

91.1 

44.2 

49.8 

66.1 

23.0 . 

33.1 

97.8 

99.1 

Red 

Blue 

87.9 

46.8 

48.2 

67.4 

24.3 

37.3 

92.1 

97.6 

Vote 

94.7 

66.3 

71.0 

82.9 

38.5 

64 . 3 

.96.8 

99.2 


TABLE C-11. 

ANALYST 

/ 

CONSISTENCY - PERCENT OF 

DECISIONS 

WITH INDICATED AGREEMENT 

No. Analysts 
Correct 

Non- 

Grain 

> 50% 
Grain 

> 80% 
Grain 

Wheat 

Oats 

Barley 

Summer 

Crop 

Pasture 
& Grass 

0 of 3 

1.7 • 

19.8 

17.1 . 

7.7 

52.7 

14.9 

1.2 

0.1 

1 of 3 

3.5 

13.6 

11.6 

8.9 

9.2 

19.6 

1.9 

0.6 

2 of 3 

9.1 

23.5 

22.5 

18.3 

16.9 

32.4 

6.8 

3.9 

3 of 3 

85,. 7 

43.1 

48.8 

65.1 

21.2 

33.1 

90.2 

95.3 





TABLE C-12. PROBABILITY OF CORRECT LABELING AS A FUNCTION OF AGREEMENT 


LO 


No. Analysts 

Non- 

> 50% 

> 80% 

In Agreement 

Grain 

Grain 

Grain 

2 ' 

72.2 

63.3 

66.0 

3 

98.1 

68.5 

74.1 


Summer Pasture 


■Wheat 

Oats 

Barley 

Crop 

& Grass 

67.3 

64.8 

62.3 

78.2 

86.7 ' 

89.4 

28.7 

69.0 

98.7 

99.6 



2p! 

In every case except oats, the probability that a correct label is 
obtained when all three analyst agreed is higher- -than when only two 
agreed. The accuracy of the labeling of oats was very poor in the 
North Dakota segments and very good in the Minnesota. 

Each analyst labeled his first three segments twice. The second 
labeling occurred after all segments were labeled. Table C-13 gives 
the analyst consistency for the segments each analyst repeated. Analyse 
Green made by far the fewest changes while Red made the most changes, 
and also had the only significant increase in accuracy. 

Comparison of average and ma.jority vote labeling to individual 
analysts : The three grain/non-grain labels were combined to form a 

majority vote label. The' majority vote label was defined to be the 
label given by the majority of the analysts. 

Table C-14 gives labeling accuracies for each analyst and vote 
for each segment for grain and non— grain. Analyst Red had the highest 
labeling accuracy for grains (71.1%), Analyst Green's grain accuracy 
(68.6%) was significantly higher than Analyst Blue's grain accuracy 
(51.0%). Analyst Blue's non-grain accuracy (95.3%) was the highest 
followed closely by Analyst Green's (93.9%) and Analyst Red's (89.8%). 

Table C-15 gives the ground truth statistics for each segment, 
normalized for bad ground truth. 

Table C-16 gives the segment grain proportion estimates for each 
analyst, vote, average, and binary ground truth (blobs are given grain/ 
other label depending on ground truth) . 

Table C-17 gives the bias of the segment proportion estimate, 
standard deviation and root mean square (RMS) of the errors for each 
analyst, vote and average. We note the following orderings: 

[bias (red) j< [bias (green) | < | bias (vote) | < | bias (ave. ) [ < | bias (blue)], 

SD (blue)<SD (ave,)<SD (vote)<SD (red)<SD (green), and 
RMSE (ave)<RMS (red)<RMS (vote)<RMS (green)<RMS (blue). 
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TABLE C-13. ANALYST CONSISTENCY IN LABELING THE SAME SEGMENTS 
AT THE BEGINNING AND END OF THE EXPERIMENTS 



Analyst/ 

Initial Accuracy 

Rework Accuracy 

Total Changes 

to. SeRraents 

_G 

NG 

G 

NG 

Total 

Correct 

Green (3) 

62.9 

96.4 

60.0 

95.7 

74 

29 

Red (3) 

66.7 

92.1 

78.9 

92.0 

175 

116 

Blue (3) 

51.2 

91.7 

52.8 

91.6 

141 

67 
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TABLE C-14. LABELING ACCURACIES FOR GRAIN AND NON-GRAIN 


Vote Green Red Blue 


Segment 

NG 

G 

NG 

G 

NG 

G 

NG 

G 

All 

94.9 

66.8 

93.9 

68.6 

89.8 

71.1 

95.3 

51.0 

1392 

90.1 

82.7 

87.2 

85.8 

75.6 

91.3 

95.4 

42.4 

1457 

97.3 

62.6 

94.5 

67.6 

96.7 

67.4 

97.3 

43.8 

1461 

90.9 

79.2 

90.9 

79.2 

88.0 

86.0 

97.9 

57.5 

1467 

80.4 

54.5 

75.8 

67.6 

74.5 

57.7 

90.8 

25.9 

1473 

98.7 

88.0 

93.5 

86.9 

98.7 

96.7 

98.0 

69.4 

1602 

95.1 

74.6 

96.6 

73.1 

93.4 

79.7 

93.1 

83.6 

1612 

94.5 

50.6 

95.8 . 

48.1 

92.8 

51.9 

91.6 

38.3 

1619 

98.2 

65.7 

97.0 

70.3 

94.4 

68.6 

97.5 

54.9 

1636 

92.7 

57.1 

91.6 

62.0 

86.8 

58.5 

94.8 

58.0 

1650 

92.4 

59.2 

93.4 

55.4 

87.2 

61.1 

91.3 

68.1 

1653 

95.2 

63.6 

98.4 

33.8 

78.9 

80.6 

91.5 

61.9 

1656 

97.5 

10.1 

98.3 

17.4 

94.7 

2.9 

95.5 

17.4 

1920 

98.4 

41.4 

97.8 

37,8 

95.1 

45.5 

•96.7 

30.6 

1380* 

94.1 

100.0 

92.3 

88.5 

95.0 

92.3 

93.8 

96.2 

1518* 

99.1 

68.2 

97.3 

. 69.7 

95.9 

74.4 ■ 

99.6 

34.4 

1566* 

97.4 

88.3 

93.9 

90.1 

93.3 

,91. o’ 

99.0 

69.8 

A. 

1825" ■ 

97.5 

79.6 

95.9 

81.0 

91.0 

83.8 

98,3 

64.0 


if 

Minnesota Segments 
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TABLE C-15 . SEGMENT GROUND TRUTH STATISTICS 
(Grain Proportions, Big Blobs Only) 


Segment 

Percent 

Grain 

Percent 

Wheat 

Percent 

Barley 

Percent 

Oats 

Percent 

Rye 

Percent 

Unknown 

1380* 

6.19 

-4.91 

.005 

1.28 

0.0 

2.9 

1392 

33.0 

26.4 

5.4 

1.1 

0,0 

1.5 

1457 

50.5 

37,0 

1.2 

12.3- 

0.0 

1.0 

1461 

40.2 

31.0 

4.6 

3.4 

1.3 

6,0 

1467 

57.1 

35.8 

10.7 

10.6 

0.0 

3.0 

1473 

49.7 

31.8 

17.0 , 

0.6 

0.3 

5.3 

1518* 

34.86 

24.44 

2.80 

7.47 

0.16 

1.74 

1566* 

35.28 

22.78 

6.35 

5184 

0.32 

4.36 

1602 

30.4 

26.6 

1.1 

1.9 

•0-.9 

0.6 

1612 

26.6 

11.1 

0.3 

15.0 

0.2 

0.5 

1619 

47.7 

35.8 

11.5 

0.4 . ■ 

0,0 

1.3 

1636 

42.5 

35:7 

2.1 

3.7 

0.9 

5.8 

1650 

29.5 

23,3 

1.3 

4.6 

0.2 

5.7 

1653 

19.0 

14.8 

0.4 

3.7 

0.1 

3.3 

1656 

15.8 

2.7 

0.4 

12.7 

0.1 

0.9 

1825* 

33.12 

17.48 

5.44 

9.73 

0.5 

6.5 

1920 

29.8 

14.9 

0.5 

14.3 

0.1 

0,4 

Average 

35.6 

24.6 

4.1 

6.5 

.25 

2.5 


* 

Minnesota Segments 
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TABLE C-16. SEGMENT PROPORTION ESTIMATES* 



Binary 

Ground 

Truth 

Proportion 

Estimates 

of Grain 

(Aggregated Blob 

Segment 

(% Grain) 

Green 

Red 

Blue 

Vote 

Average 

1392 

.37 

.395 

.483 

.209 

.374 

.362 

1457 

.53 

.375 

.364 

.242 

.343 

.327 

1461 

.45 

.475 

.528 

.338 

.459 

.447 

1467 

.25 

.576 

.474 

.211 

.443 

.421 

1473 

.56 

.533 

.564 

. 444 

.530 

.514 

1602 

.29 

.254 

.290 

.267 

.263 

.270 

1612 

.21 

.110 

.132 

.117 

.112 

.120 

1619 

.57 

.461 

. 462 

.498 

.436 

.474 

1636 

.48 

.347 

,.358 

.335 

.324 

.347 

1650 

.22 

.182 

.214 

.211 

.196 

.202 

1653 

.16 

.073 

.265 

.146 

.141 

.161 

1656 

.15 

.032 

.036 

.044 

.024 

.037 

1920 

.29 

.144 

.195 

.115 

.155 

.151 

1380** 

.05 

.098 

.108 

.083 

.102 

.096 

1518** 

.36 

-274 

.312 

.114 

.267 

.233 

1566** 

.36 

.363 

.369 

.267 

,334 

.333 

1825** 

.40 

.357 

.383 

.288 

.345 

.343 


Blobs which have at most 10% bad ground truth. 
Minnesota Segments 
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TABLE C-17. 

BIAS, STANDARD 

DEVIATION AND 

ROOT MEAN 

SQUARE OF ERRORS 

Source of 
Labels 

Bias 

(F-P) 

Standard 
Deviation 
of P-P 

EMSE 

V(P-P)^ 

Performance 

Ordering, 

Analyst 

Green 

-.038 

.110 

.116 

• 4 

Analyst 

Red 

-.010 

.099 

.100 

2 

Analyst 

Blue 

-.104 

.081 

.132 

5 

Vote 

-.050 

.089 

.102 

3 

Average 

-.051 

.084 

.098 

1 
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The lower variance term of analyst average offsets the lower bias 
term of Red to give better overall results (EMSE) . In terms of perform- 
ance, analyst average was best, closely followed by Analyst Red, vote 
and Green. 

Simple linear regression was used to predict ground truth grain 
given the labels from each analyst, vote and average. The results are 
plotted in Figures C-6 through C-10. The regression lines are all 
plotted on Figure C-11. A multiple regression was used to predict 
ground truth grain given the proportion estimates from each analyst. 

The results are given in Table C-18. Analyst Blue’s proportion estimate 
was best in the sense that the correlation coefficient was higher than 
those of Analysts Red and Green, and the correlation coefficient of 
Analyst Blue was almost as high as the multiple regression coefficient 
using all three analysts. The regression coefficient for Analyst Blue 
was the only one to be significantly different from zero (5% level) . 

The simple correlation coefficient for Analyst Blue is higher than those 
for average and vote. This is very interesting since the EMSE perform- 
ance of Blue is the lowest. RMSE(P) is a measure of the distance between 
the points (P^, P^) and the 45° line through the origin. The proportion 
estimates of the other sources of labels fit this line better than the 
proportion estimate of Blue. Simple linear regressions will fit the best 
line through the data. Thus, we conclude that Blue’s proportion estimate 
has the best linear trend, but the y-intercept and the difference of the 
slope from 1 causes Blue’s EMSE to be higher than the EMSE of the other 
sources of labels. 

C.3.3 INVESTIGATION OF RELATIONSHIP BETWEEN ANALYST PERFORMANCE 
AND LABELS OF "LOW CONFIDENCE" AND "MIXED TARGET" 

The snalyst, in addition to labeling a blob grain/non-grain, 
could flag a blob as "low confidence in label" and/or "mixed targets." 

The analysts used these labels very rarely. There were very few blobs 
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FIGURE C-6. ANALYST GREEN'S PROPORTION ESTIMATE 
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Ground Truth Proportion of Grain 
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Proportion Estimate for Big Blobs 
FIGURE C-9. PROPORTION ESTIMATE USING VOTE 
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TABLE C-18. HEGRESSION COEFFICIENTS FOR ANALYST-BASED 
PROPORTION ESTIMATES FOR BIG BLOBS 


Correlation 


Estimate 

Y-Intercept 

Slope 

Coefficient 

Green 

0.132 

0.68 

0.76 

Red 

0.084 

0.77 

0.78 

Blue 

0.083 

1.12 

0.84 

Average 

0.099 

0.86 

0.79 

Vote 

0.091 

0.86 

0.81 


Multiple Regression of Ground Truth Grain on Analyst Labels 




ANOVA Table 


Source 

Sum of 
Squares 

ii 

Mean Square 

F Significance 

Regression 

.26873 

3 

.089576 

11.384 .0006 

Error 

.10229 

13 

,0078688 


Total 

.37102 

16 



Variable 

Coefficients 

STD -Error 

Significance 

Constant 


.067471 ■ 

.05115 

.2426 

Green 


.12652 

.38938 

.7504 

Red 


.14943 

.46177 

.7514 

Blue 


.80616 

.34471 

.036 
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that received these labels by more than one analyst. However, the 
analysts were very consistent in their proportion of flags within 
the grain purity classes: 0-20%, 20-40%, 40-60%, 60-80%, and 

80-100%. When one or more analysts gave a "low confidence" label 
to a blob, a new label "doubtful" was given to that blob. Table C-19 
gives the doubtful labels broken down by grain purity levels and label ' 
by vote.- The proportion of doubtful labels increased as the grain 
increased. This result is consistent with the results given earlier 
that non-grains were labeled more accurately than grain. The ratio 
of doubtful labels to not doubtful labels of the grain labels de- 
creased as percent grain increased. Thus, the analysts were much 
less confident of their grain labels given the non-grain targets. 

Blobs which received one or more mixed labels were tabulated 
and are given in Table C-20. The percent of mixed labels increased 
as percent grain increased from 0-50% and then generally tended to 
decrease from 60-100%. This is what should be expected; that is, 
as the labeling targets become less pure, the percentage of mixed 
labels should increase. 

The trend is much less clear in those blobs with grain purity 
50-80%. Also, the percentage of mixed labels were much higher for 
pure grain blobs than for pure non-grain blobs. This seems to indicate 
that pure grain fields behave in a more confusing fashion than pure 
non-grain fields. Distributions, of grain purity are given in Section 
C.4 of this Appendix. 
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TABLE C-19. ANALYST DOUBTFUL LABEL PERFORMANCE 


Analyst Decision 


Purity as 
a Grain 

Percent of 
Doubtful 
Labels 


Doubtful 

Not 

Doubtful 

Fraction 

Labeled Grain 


Grain Non-Grain 

Grain 

Non-Grain 

Doubtful 

Not Doubtful 

Ratio 

0-20% 

1.60% 

13 

36 

24 

3037 

.27 

.02 

I3.5 

21-40% 

3.60% 

6 

10 

57 

388 

.37 

.13 

2.8 

41-60% 

9.01% 

17 

14 

117 

227 

.54 

.34 

1.6 

61-80% 

10.00%. 

34 

13 

274 

196 

.72 

.58 

1.2 

81-100% 

16,54% 

185 

47. 

976 

427 

.79 

■ .70 

1.1 
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TABLE C-20. KELATIONSHIP BETWEEN PERCENT 
GRAIN AND "MIXED" LABELS 


Percent Grain 

0 - 10 %- 

10 - 20 % 

20-30% 

30-40% 

40-50% 

50-60% 

60-70% 

■ 70-80% 

80-90% 
90-100% 


Percent With One or More 
"Mixed Target" Flag 

1.1 

3.8 

6.8 
11.3 
20.6 
16.2 
17.6 
13.0 
13.2 

7.4 
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C.4 TEST AND EVALUATION OF PROCEDURE M 

C.4.1 ANALYST COMMENTS ON BLOB LABELING 

Each analyst was asked to complete a segment comment form after 
labeling each segment and a final comment form after labeling all 
segments. The comment forms are’ given in Section C.2 of this 
Appendix, In addition, the analysts generated a report [32] on 
analyst labeling of blobs. • 

General comments on blob labeling ; From [32] we obtain a list 
of strong points, problem areas, and recommendations. 

Strong Points 

• Blobs are easier to label than the dots used in Procedure 1 
(P-1) . A blob represents a field center and does not con- 
tain border or edge dots as does a P-1 dot. 

• BLOBS represent field centers rather well. 

• If ERIM reduces the number of blobs to approximately 100, 
as currently planned, labeling of blobs should be as effi- 
cient or perhaps more efficient than the labeling of dots 
in- Procedure 1. 

Problem Areas 

• The blobbing technique, as' currently implemented, produces 
too many blobs for labeling (400-600) . 

• Small or stripped fields do not blob or cluster very well. 

• Acquisitions selected for blobbing by ERIM' for the labeling 
test were not always optimum. 
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• Blobs were frequently disjointed which resulted in labeling- 
difficulties. 

• Small blobs containing only one to three pixels are difficult 
to label. 

Recommendations 

• Acquisition selection for blobbing should be based on multi- 
temporal/spectral information as well as spatial Information 
(spatial data was used as the primary selection criteria for 
the test).. 

• Research should be conducted into the small fields problem. 

• Reduce the use of single— pixel blobs whenever possible. 

• Modify the line printer blob map and Production Film Converter 
(PFC) blob overlay. The current format of these two products 
is not conductive to efficient analyst labeling. 

The 1®^ comment under Strong Points and the l^t under Problem Areas 
are comments about the exhaustive labeling of all big blobs used in this 
experiment compared to the labeling of a sample (of size 100) of big 
blobs in an operational- mode of Procedure M. The exhaustive labeling 
allows us to compute the variances of many different sampling schemes. 

The 2^*^ comment under Problem Areas^ should be viewed as a comment 
about big blobs not formed from small or stripped fields. We have 2 
found many Blobs representing small and stripped fields which would 
have been good labeling targets but were not Included because they did 
not contain any interior pixels. We are currently conducting, research 
on defining the attributes of' those little blobs which would make good . 
labeling targets. The last comment under Problem Areas is almost the 
opposite of the 2^^^, i.e., blobs with only one to three interior pixels 
are difficult to label. 
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The 4^^ coniment under Problem Areas has motivated the development 
of a new blobbing procedure called ' SUPERB which should eliminate most 
of the disconnected blobs. 


The 3^*^ comment under Problem Areas would be at least partially 
addressed by having an analyst involved in the selection of acquisitions 
for blobbing. 

The 1®*^ comment under Recommendations would improve the relation- 
ship between fields and blobs. Some of the acquisitions used .in blobbing 
were misregistered by up to three pixels which possibly caused some blobs 
to be disconnected and caused some to be smaller than what otherwise 
would be the case. 

The 2 ^^ and comments under Recommendations are very closely 
related. The more small fields that are sample targets the more blobs 
with only one interior pixel will be sampled and labeled. On the other 
hand, bias is introduced by any restriction on sampling. Much more 
research must be done. in this area. 

Summarized analyst comments for each segment ; Table C-21 summarizes 
the analysts responses to parts 5b and 6c of the segment comment form 
Table C~22 summarizes the analysts' responses to parts 6a and 6b. 

Analysts A and C both flagged Segments 1650 and 1920 as having blobs 
with impure interiors and as. having blobs whose patterns do not match 
the field patterns.* Table C-23 sunnnarizes the analysts' responses 
to part 6d. Segment 1650 had many small strip fields while Segment 
1920 had poor acquisition history. Table C-24 summarizes the analysts' 
responses to part 7. Table C-25 summarizes the analysts' responses to 
part 8: "Did. you have to change your procedure?" We note that few 

changes were made and these were minor. Table C-25 also gives analyst 
remarks on moisture and elevation. 


The arbitrary association of letter symbols with color symbols for 
analysts in these tables was established by coin flips. 


93 




TABLE C-21. SUMMARY OF ANALYST COMMENTS ON PARTS 5b AND 6c 
OF THE SEGMENT COMMENT FORtI 


(Part 5b) 

Were Landsat acquisi- 
tions deficient? 


(Part 6c) 

Was choice of blob acqui- 
sitions optimal? 


Segment 


Analyst A 


Analyst A 


1380 

1392 

1457 

1461 

1467 

1473 

1518 

1566 

1602 

1612 

1619 

1636 

1650 

1653 

1656 

1825 

1835 

1920 


N 

Y need late August, 
late June 

Y missing acq. 
during July 

Y need late June, 
early Sept. 

Y need late Aug. 
late June 

Y lack acqs. in 
May/ June 

N->- some clouds 

Fair more July, Aug, Sept. 

Y need pre-plant, 
early Sept. acq. 

Y need late June 

Y lacks June, early July 

Y missing late June 

Y lacks late June' acq. 

N-J- late Aug. would have 

helped 

Y poor data in late June, 
early July; poor acq. 
history 

N 

N 

Y no June, early July acq. 


N 8205 has poor 

field definition 

N(?) inclusion of 8136 
might help remove 
mixed blobs 

Y 

Fair 

N 8218 misregistered 

May, June dates would 
have helped stream areas 

Y 

Y- 

Y 

Y 

Y few other choices 
. Y 

Y 

OK 

Y 

OK might have used, 8218 

Y 

Y 

Y few other choices 
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TABLE C-21. SUMMARY OF ANALYST COmENTS ON PARTS 5b AND 6c 
OF THE, SEGMENT COMMENT FORM (Cont.) 



(Part 5b) 

Were Landsat acquisi- 
tions, deficient? 

(Part 6c) 

Was choice of blob acqui- 
sitions optimal? 

Segnent 

Analyst B 

Analyst 


1380 

N 


N 

should use 
8169 (vice) 

8222 (harv/ripe) 

1392 

Y 

lack of harv, post- 
harv acq. caused SG/ 
summer crop confusion 

N 

should've also used 
8136, 8154 

1457 

Y 

need July acq. 

Ade- 

quate 


1461 

N 


N 

should've used 
8137, 8190, 8217, 8236 

1467 

N 

need late Aug. , 
early Sept . 

N 

misregistration; should 
have used 8155,8199,821: 

1473 

Y 

needed -May, late Aug. 

Y 


1518 

N 


N 

needed May 15 acq. 

1566 

B 

needed late Aug . 
early Sept. 

N 

all 4 acq. should've 
been used 

1602 

Y 

needed May acq. 

Ade- 

quate 


1612 

Y 

need Sept. acq. 

N 

2 acq. not enough; 
should've used 8137, 
8155, 8199, 8236 

1619 

Y 

need June acq. 

N 

need all four acq. 

1636 

Y 

1977 Fall date would’ve 
helped 

N 

acqs. all too far into 
growing season 

1650 

N 


• Y 


1653 

Y 

lack ripe, harvest acq. 

Y 

adequate 

1656 

N 


Y 


1825 

Y 

need 1 acq. after 
Sept. 7 

N 

8133, 8169,. 8223, 8250 
would've been better 

1835 

N 


Y 


1920 

Y 

early Sept. acq. needed 

Y 

adequate 
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TABLE C-21. SUMMARY OF ANALYST COMMENTS ON PARTS 5b AND 6c 
OF THE SEGMENT COMMENT FORM (Cont.) 



(Part 

Were 

tions 

5b) 

Landsat acquisi- 
def icient? 

(Part 6c) 

Was choice of blob acqui- 
sitions optimal? 

Segment 

Analyst C 

Analys t C 


1380 



Y 


1392 

N 


N 

needed 8154 

1457 

— 


N 

only 3 dates; 8174 
may have helped 

1461 

— 


N 

8236 might have 
better defined blobs 

1467 

- 


N 

needed early date 

1473 

Y 

haze, clouds, 
cloud shadow 

- 


1518 

Y 

haze, clouds, 
cloud shadow 

N 

needed 8206 

1566 

- 


N 

needed 4 or more 

1602 

N 


N 

should've used 4 

1612 

N 

haze, clouds, 
need later dates 

Basically 

Yes 

needed 8199 

1619 

Y 

clouds, haze on 8198 

N 

could've used better 
days 

1636 

N 


N 

needed early date 

1650 

Y 

need good pre- 
planting acq. 

N 

should've used later 
harvest acq. 

1653 

Y 

no satisfactory' pre- 
plant or harvest 

Y 


1656 

- 


N 

use 1,2 more acq. 

1825 

- 


Y 


1835 

N 


N 

need more spread 

1920 

- 


- 
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TABLE C-22. SUMMARY OF ANALYST COMMENTS ON PARTS 6a AND 6b 
OF THE SEGMENT COMMENT FORM 


(Part 6a) 

Do blob interiors seem 
pure? 


(Part 6b) 

Do blob patterns match 
field pattern? 


Segment Analyst A 


Analyst A 


1380 

Y 

1 exception 

Y 

but much smaller 
than fields 

1392 

Fair 

6 mixed 

Fair 

patterns on imagery 
poorly defined 

1457 

Y 

mostly 

Y 

mostly 

1461 

Y 

3 exceptions 

Y 

mostly 

1467 

N 

many areas contain 
more than one field 

Fair/Poor 


1473 

Y 


Y • 


1518 

Y • 

but one 

Y 


1566 

Y 


Y 

fairly well 

1602 

Y 

except 3 

Y 


1612 

Y 

except 3 in 
strip area 

Y 

fairly well except 
strip area 

1619 

Y 

6 exceptions 

• Y 


1636 

Mostly 


Y 


1650 

N 

many mixed 

N 


1653 

Mostly 

3,4 exceptions 

Fair 


1656 

Y 


Some 

patterns poorly 
defined on imagery 

1825- 

Y 

3 exceptions 

Y 

mostly 

1835 

Y 

mostly 

- 

difficult to tell 

1920 

N 

many in strip areas 

N 

field definition 


poor on imagery 
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TABLE 0 ,- 11 . SUMMARY OF’ ANALYST COMMENTS ON PARTS 6a AND 6b 
OF THE SEGMENT COMMENT FORM (Cent.) 


(Part 6a) 

Do blob interiors seem 
pure? 


(Part 6b) 

Do blob patterns match 
field pattern? 


Segment ' Analyst B Analyst B 


1380 

Y 

generally 

Y 

generally 

1392 

Y 

generally • 

Y 

generally 

1457 

Y 

generally 

Y 

generally 

1461 

Y 

generally 

Y- 


1467 

Y 

generally ; 
many exceptions 

Y 

generally 

1473 

Y 


Y 


1518 

-Y 

generally 

Y 

generally 

1566 

Y 

generally 

Y 

generally 

1602 

■ Y 

some exceptions 

Y 

generally 

1612 

Y ■ 

generally 

Y 

generally 

1619 ■ 

Y 

generally 

Y 

generally 

1636 

Y 

in most cases’ 

Y 

mostly 

1650 

Y - 

- generally 

Y 

generally 

1653 

Y 

some exceptions 

Y 

generally 

1656 

Y 

generally 

Y 

generally 

1825 

Y 

generally 

Y 

generally 

1835 

Y 


Y 


1920 

Y 

generally 

Y 

generally 
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TABLE 

C-22. 

SUMMARY OF ANALYST COMMENTS ON PARTS 6a AND 6b 
OF THE SEGMENT COMMENT FORM (Cont.) 


(Part 6a) 

Do blob interiors seem 
pure? 

(Part 6b) 

Do blob patterns match 
field pattern? 

Segment 

Analyst C 

Analyst 


1380 

Y 

overall 

Y 


1392 

Y 

overall 

Y 

overall 

1457 

Y 

overall , 

Y 

overall 

1461 

Y 


Y 

overall 

- 1467 

Y 

overall 

Y 

overall 

1473 

Y 


Y 

except a few 
fields 

1518 

Y 


Y 


1566 

Y 

overall 

Y 

as far as could 
be told 

1602 

Y 


Y 

generally 

1612 

Y 

some problem in 
strip area 

Y/N 

strip area caused 
problems 

1619 

Y 


Y 

overall 

1636 

Y 


Y 

■ mostly 

1650 

N 


N ■ 


1653 

Y 


Y 


1656 

N -i- 

for spring grains 

Y/N 

more for aon-SG 
than SG 

1825 

Y 

generally 

Y 

where patterns were 
distinguishable 

1835 





1920 

N 


N 

somewhat, overall poor 
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TABLE C-23. SUMMARY OF ANALYST COMMENTS ON 


S egment Analyst Other Comments Re Blob Patterns ■ 

A - ^ 

1380 B None 

Whether a blob is developed appears to depend 
on field width. 

A 

1392 

’ B None 

C 

A 


1457 


1461 


1467 


1473 


E None 

C 

A 

Certain areas screened out; wet spots 8155; 
(see analyst form). 

C - ' 


A 

B Several areas were screened out (many SG fields) . 

More mixed blobs than previously encountered; 

^ many small. 

A Nice blob sizes; easy to work with. 

Worked like a charm! Blobs pure and umambiguous , 
helped in fine discrimination. 

Blob match to field depends on size; smaller 
more mixed. 


C 


6d OF THE SEGMENT COMMENT FORM 


Other Comroenti 


SG is planted relatively early ii 
harvested by third week in Augt 
Dots still misleading. 


Because of wide differences in p! 
pretation of late SG from summc 
Appear to have some SG startlnj 
Inadequate acquisition to conf: 
None 

Repeat of symbols; analyst can't 
greater than 3 pixels as some 1 

Multiple planting date for SG in< 
of interpretation. 

Dots still confuse; color separal 
feel comfortable with this one. 


The SG labels Include some flax \ 
crop in area. 

Problem with disjoint blobs and i 

Little confidence is place on soi 
tations; confusion with possib! 
of lates. 

None. 

Border around blobs not constant 
misleading. 


None. 

Lack of acquisition increases er: 
tween 26 April and 16 July) . 
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TABLE C-23. SUMMARY OF ANALYST COMMENTS 


SeRinent Analyst Other Comments Re Blob Patterns 


1518 

A 

B 

C 

Couple examples of disjoint blob pixels. 

Some mixed blobs, but less than some other 
segments. 


A 

- 

1566 

E 

- 


C 

- 


A 

- 

1602 

B 

- 


C 

Bigger strip field width fewer mixed blobs. 


A 

- 

1612 

B 

- 


C 

- 


A 

Mostly larger blobs; easier to work with. 

1619 

B 

- 


C 

- 


A 

- 

lo36 

B 

Should eliminate single pixel + small blobs. 


C. 

More mixed blobs than previously. 


i PART 6d or THE SEGMENT COMMENT FORM (Cont.) 
^ Other Comments 


Clouds and haze make interpretation of some signa- 
tures shaky. 

Some SG did not show usual signature trends (see 
analyst form for discussion). 

Some Impure and confusing signatures; signature pro- 
gression hard to follow; some blob ID as SG may be 
flax. 


Possibly 2 planting dates .some SG might -have been 
called summer crops. 

Field patterns hard to discern; need more blob 
acquisitions. 


Size of most strips in segment is as small as will 
produce a blob center. 

SG in segment shows classical temporal signatures 
which facilitated interpretation. 

Winter crops not separable; easy segment to inter- 
pret; good signatures; piece of cake. 



Since wheat was in strip area, fewer blobs labeled 
SG than expected. 

Some difficulty determining strip field limits; blob 
• colors a problem. 


Symbols look too much alike; hard to follow blob 
boundary. 

Good segment; high confidence level. 


Two different SG planting dates makes interpretation 
questionable. 

Need more FFC scale blob maps. 

Small blobs cause some confusion. 
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IT 6d OF THE SEGMENT COMMENT FORM (Cent.) 

Other Coinments 

Small strip fields ( satellite resolution) makes 
interpretation difficult. 

Natrow strip fields messed things up (see analyst's 
sheet) . 

Color of blob map and use of on computer map 
confusing. 

Blobbing didn't work as well because of narrow strip 
fields. 

Segment generally displays inhomogenous signatures; 
muttled appearance. 

Use "/" Instead of blob overaly colors bad; 

eliminate small blobs; use grid to check for over- 
looked blob. 

Small 'fields without blobs probably results in low 
SG estimate. 

Segment difficult due to small size, lack of dis- 
tinct field patterns of strips. 

Soma "SG" signatures follow a faster progression. 

Two apparent SG planting dates caused confusion. 

Confidence Level: Ave. Small fields, no clear 

field pattern. 

Difficult to Interpret because of irregular shapes/ 
poor field definition. 

Grain and non-grain signatures similar during summer. 


Small strip fields; poor blobs; segment is terrible 
to interpret. 

Poor segment; very low confidence rating; key acqui- 
sitions missing; high confusion factor. 
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TABLE C-24. SUMMARY OF ANALYST COMMENTS 




Percent 

Percent 


Segment 

Analyst 

Agriculture 

Small Grains 

Manor Crops 


A 

80-90 

Small 

Corn and Soybeans 

1380 

B 

98 

Relatively 

little. 

Corn and Soybeans 


C 

100 




A 

50-60 



1392 

B 

80 


Other 20% range- 
land. 


C 

60-70 




A 60-70 


B 85 

1457 

C 80-85 20-25% 2000 ac. winter, 

rest spring/ 
summer crops 

A 75-80 SG 

1461 B ■ 90 High SG 

C 90-95 


A 90-95 

1467 B 90 

C 100 


1473 


A 

B 

C 


90-95 

98 

100 


High % 


Wide Variety 
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TABLE C-24. SUMMARY OF ANALYST COMMENTS ON PART 7 OF THE SEGMENT COMMENT FORM (Cent.) 
Percent ' Percent 


Seament 

Analyst 

ARrlculture 

Small Grains 

Maior Crops 

■ Field Size 

TopoRraphv 


A 

• 50-60 


Large Forest Areas 

80-160 acres 


1518 ' 

B 





Smooth lake basin, poor drainage. 





small water, basin. 


C 

60 


30% pasture or 
scrub 

5-250; 60-80 ave. 

Level, no distinct drainage. 


A 

70-80 




Flat to gently rolling hills; some 

1566 






lakes, marshy areas; minor 
drainage . 

B 

90 

Hxgh 



Drift prairie; almost entirely 







glacial deposits. , 


C 

75-85 



Range: 10-140 ac. 

Glaciated; gently rolling; numerous 






ave. = 60 ac. 

lakes . 


. A 

40-50 



1/8-1/16 section 

Flat terrain; several large lakes. 


B 

75-80 




Irregular; gently rolling glaciated 

1602 

C 

75-80 



5-250 ac. strip 

plains. , morraine. 

Glaciated; moderately rolling, mor- 






1-8 pixels- 

raines, numerous kettles. 


A 

20-30 



40 acres 

Fairly flat; several large lakes. 

. 1612 . 

B 

40 




Drift' prairie: • glacial deposits. 

C 

45-55 



10 70-80 acres. 

knolls, depressions, some streams. 
Flat; low hills; numerous lakes. 



A' 

90 



1/4 section 

Flat. 

1619 

B 





Mostly smooth, -nearly level lake 

C 

‘ lob' 



Range: 10-200 ac. 
ave. =. 100-160 

basin. 

Gentle to flat. 




A 

75-80 



1/8-1/4 section 

Fairly flat; few low areas that 







hold water. 

1636 

B 

80 



Medium 

Drift prairie: glacial deposits. 







undulating plain. 


C 

100 



. 20 300 ac. 

ave. = 100-150 ac. 

Flat with scatter glacial deposits. 


A 

30-40 




Gently rolling hills, few small 


• 





lakes. 

1650 

B 

70 




Glacial deposits: rolling plain. 







some badlands top. 


C 

75-80 



5-125 acres 

Rolling terrain; some glaciation. 
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TABLE C-24. SUMMARY OF ANALYST COMMENTS ON PART 7 OF THE SEGMENT COMMENT FORM (Cont.) 


Percent Percent 

Segment Analyst Agriculture Small Grains Major Crops Field Size Topography 



A 

20-30 

20-30 acres 

Fairly flat. 

1653 

..B 

40 


Gently rolling; glaciated landscape. 


C 

15-20 

5-200 acres 

Undulating terrain; intermittent 





creek, lake. 


A 

20-30 

Small Amount SG 30-40 acres 

Flat area with agriculture; rest is 

1656 




fairly hilly. 

B 

20 

Small 

Extremely rugged; mostly natural 



85 


vegetation and rangeland. 


C 

3-60 acres 

Medium to heavy direction near 





Heart River; gentler in east. 


A‘ 

60-70 

100 acres 

Flat with trees, natural vegetation 





along a river. 

1825 

B 

80 


Generally smooth, nearly level lake 





basin. 


C 

55 

10-120 acres 

Level with gentle slope west; 





glaciated. 


A 

50 

Small 

Swampy area; hills; several lakes. 

1835 

■ B 

80 

Small 

.Wet area; several bodies of water. 

C 


10-70 acres 

. Heavily glaciated; numerous marshes 





and lakes. 


A 

20-30 

30-40 acres 

Agricultural area fairly flat; 

1920 




rest is rather hilly. 

•B 

50 

Medium 

Gently rolling glaciated plain with 





some kames and morraine. 


C 

30-40 ' 

Low Spring Grain Segment 10-20 acres 

Dissected in X7est; gently rolling 





to flat in east. 
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TABLE C-25. SUMMARY OF ANALYST COMMENTS ON PART 8 OF THE SEGMENT COUIMENT FORM 


Segment 

Analyst 

Did 

You Have to Change Your Procedure? 

Moisture? 

1380 

A 

Yes. 

Identified small grains blobs 
first, then non-small grains 

Adequate 


B 

Yes. 

Because of significant dis- 
tortions , Product 3 used 
extensively 



' C 

- 



1392 

A' 

No 




B 

No 




C 

No 



1457' ■ 

A 

Yes. 

Slightly because of many dis- 
continuous blobs 

Adequate 

- 

B 

No 




C . 

No 



1461 

A ■ 

No 


More than 
adequate 


B 

No 


Moist to wet 
throughout 


C 

No 



1467 

A 

No 




B 

No 



' 

C 

No 




Elevation 


2000-2300* 


Avg, '^'1600* 


(Page 1 of 4) 



107 


TABLE 

Segment 

1473. 

1518 

1566 


1602 

1612 


:-25. SUMMARY OF ANALYST COMMENTS ON PART 8 OF THE SEGMENT COMMENT FC 


Analyst 

Did You Have to Change Your Procedure? 

Moisture? 

A 

No 

Adequate 

B 

No 


C 

- 


A 

- 


B 

No 


C 

No. Just a little more cautious 


A 

No 


B 

No 

Area is 
generally wet 

C 


Drainage is 
deranged 

A 

No 

Adequate 

B 

No 


C 

- 

Climate semi-ai 

A 

No 


B 

No 


C 

Yes. Labeled non-spring grains, 
then spring grains 

Deranged 

drainage 



A=5' 


of 4) 
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TABLE 

Sesment 

,1619 


1636 

1650 


1653 


1656 


J.-25. SUMMARY OF ANALYST COMMENTS ON PART 8 OF THE SEGMENT COMMENT FORM -(Cont.) 


Analyst 

A 

B 

C 


Did You Have to Change Your Procedure? Moisture? 


Yes. Colored obvious non-grain, then 
obvious grain 

No 

’No 


No apparent 
water problem 

Poor drainage 

'Drainage den- 
dritic, but 
intermittent 


Elevation 


900' 


1 = 0 -' 


A 

B 

C 


No 

No 

No 


Deranged drainage 1465' 


A 

B 

C 


No 

No 

No 


Adequate 

Relatively dry 

Dendritic 

drainage 


2835’ 


A 

B 

C 


No 

No. 

Yes. 


Product 3's removed (confusing); 
blob map used only w/ Product 1 


Adequate 


No rain noticed 1950' 


A 

B 

C 


No 

No 


1750-2350' 
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ON PART 8 OF THE SEGMENT COMMENT FORM (Cont,) 


Your Procedure? Moisture? Elevation 

No apparent ' 
problem 

Wet in Spring 800-1000’ 

Adequate 1000-1310' 

Adequate 
Wet area 

1325' 


2100-2450’ 
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C.4.2 PARAMETERIZED PERFORMANCE OF PROCEDURE M 


The variance and bias of Procedure M are viewed as functions of 
the parameters which control sample size, type of stratification, number 
of strata, and source of labels. Methods of estimating the bias and 
variance of Procedure M’ s segment spring small grain (wheat/barley) 
proportion estimate follow.. 


The bias from sampling only big blobs is the difference in the 
ground truth proportions of grain within the big blobs and of grain in 
the complete segment. 


Given a fixed segment, source of labels and stratification denote 
the label given the jth blob of stratum s as C,^. The bias due to the 
source of labels is 


where 


BCLUST N 


b(P) = E 

s=l 


N 


s N. 

E ^ C. -P 

jtl N.s js s 


C; 

JS 


N. = 
JS 

N 

.s 


1 if label is grain, C. =0 otherwise, 

J s 

number of pixels in blob j of Stratum s, 
number of big blob pixels in Stratum s. 


N = number of big blob pixels in segment. 


P = ground truth proportion of grain in big blobs. 


and given a sample S - ^]^bS 2 US^U< • chosen using the Midzuno 

A 

sampling technique, P is the sample proportion, namely 
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2pi 


E N. c. 


BCLUST N ^ / jeS 


p= L 

s=l 


.s 


N I y; N. 

• • ' • o J S 


If n is the s stratum sample size, and M is the number of blobs 

3 S 

in the s stratum, then there are I I possible samples from Stratum s 

of size n which could be obtained. ¥e denote these by 
s 


S . , i=l,2,3, • . . , , 
si’ > ’ ’ ’ Vn* 


The mean square error of 


p= E 


BCLUST N g ^ 

y— P 
, N s 
s=l 


xs 


'M 


(*) MSE(P) = 


BCI.UST 



E 1 


E 

S=1 

i=l 


E N. C. 
j7s. 

xs 


jeS. N. 

‘ IS j s 



where 


P = 


E N. C. 

jsS. 

xs 


E 

jeS 


N. 

xs 


xs 


The-, variance of P is 


V(P) = MSE(P) - b (P), 
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A proof of (*) can be found in [ ] . 

Examination of the bias and variance of Procedure M as a function 
of type of strata, number of strata, sample size, and source of labels 
follows. 

Denote the bias , mean square error and variance of Procedure M 
proportion estimate for Segment j as 

bj(a,N,n,g), MSE^ (a,N,n, 6) , and (a,N,n, 3) 

where 

a = static trajectory strata, BCLTJSTER, or tolerance blocks, 

N = 40, 50, or 60, 
n = 80, 100, or 120, 

3 = analyst green, red, blue, vote, average, or ground truth. 
The sampling reduction of variance factor is 

(g,N,n,3) 

V^(-,l,n,3) 

The effect of sampling strategy will be studied by examining 

1 17 • 

V (a, N,n, ground truth) = yt I V. (a, N,n, ground truth) 

j=l ^ 

for different combinations of a, N, and n given above. 

The effect of the source of labels will be studied by examining 

V (BCLUSTERs,40,100,3) - V (BCLUSTERs, 40, 100, ground truth) 
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for 

P = analyst green, 

= analyst red, 

= analyst blue, 

= vote, and 
= average label. 

C.4.3 EVALUATION OF THE BCLUSTER, BLOB, AND MACHINE LABELER 
COMPONENTS 

Evaluation of the BCLUSTER : Because of a delay in obtaining ground 

truth for the Minnesota segments, the evaluation of the BCLUSTER com- 
ponent is performed' only on the 13 North Dakota segments. A measure of 
the value of a stratification is the ratio of .the stratified variance 
to the unstratified variance. This measure is called the reduction of 
variance (RV) factor. Table C-26 gives the RVs for' sampling with replace- 
ment.- The average RVs for 20, 40, and 60 BCLUSTERs are .637, .544, and 
.483, which are close to what was obtained in last year's experiment. 

In the case where the sampling is with replacement we -have 

0 .< RV £ 1. 

This is not always the case if the sampling is without replacement. The 
stratum finite correction factors have less effect on the stratum variance 
than on the unstratified variance. • The methods in which true RVs will be 
computed are outlined in the last section for the case where sampling 
is without replacement (used by Procedure M) . 

Table C-27 gives the number of BCLUSTERs broken down by their grain 
purity for 20, 40, and 60 BCLUSTERs. The number of blobs, number of 
pixels, percentage of BCLUSTERs, percentage of blobs, and percentage 
of pixels are also given in Table C-27 . 
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TABLE C-26. REDUCTION OF VARIANCE (NORTH DAKOTA SEGMENTS) 


Number of BCLUSTERs 


Segment 

20 

40 


1392 

0.709 

0.479 

0.422 

1457 

0.562 

0.516 

0.441 

1461 

0.715 

0.504 

0.482 

1467 

0.723 

0.683 

0.621 

1473 

0.441 

0.335 

0.315 

1602 

0.463 

0.434 

0.371 

1612 

0.701 

0.654 

0.627 

1619 

0.490 

0.342 

0.292 

1636 

0.705 

0.616 

0.469 

1650 

0.728 

0.634 

0.616 

1653 

0.657 

0.613 

0.500 

1656 

0.731 

0.652 

0.541 

1920 

0.654 

0.608 

. 0.577 

R 

0.637 

0.544 

0.483 
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TABLE C-27. NUMBER OF BCLUSTERs BY GRAIN PURITY 


20 BCLUSTERs 



Number 

Number 

Number 

Percent 

Percent 

Percent 

Percent 

of 

of 

of 

of 

of 

of 

Grain 

BCLUSTERs 

Blobs 

Pixels 

BCLUSTERs 

Blobs 

Pixels 

0-10% 

127 

1758 

93203 

41.64 

36.18 

38.23 

10-20% 

16 

544 

30484 

5.25 

11.20 

12.50 

20-30% 

16 

301 

11241 

5.25 

6.19 

4. 61 

30-40% 

9 

289 

15771 

2.95 

5.95 

6.47 

40-50% 

7 

68 

2328 

2.30 

1.40 

0.95 

50-60% 

6 

39 

1653 

1.97 

0.80 

0.68 

60-70% 

12 

639 

32262 

3.93 

13.15 

13.23 

70-80% 

14 

412 

18157 

4.59 

8.48 

7.45 

80-90% 

12 

431 

22640 

3.93 

8.87 

9.29 

90-100% 

86 

378 

16043 

28.20 

7.78 

6.58 


40 BCLUSTERs 


0-10% 

261 

1817 

91250 

49.81 

-37.04 

37.43 

10-20% 

23 

518 

31202 

4.39 

10.56 

12.80 

20-30% 

24 

503 

25105 

4.58 

10.25 

10.30 

30-40% 

14 

208 

9092 

2.67 

4.24 

3.73 

40-50% 

16 . 

122 

4627 

3.05 

2.49 

1.90 

50-60% 

13 

117 

7266 

2.48 

2.38 

2.98 

60-70% 

13 

173 

7353 

2.48 

3.53 

3.02 

70-80% 

19 

263 

12403 

3.63 

5.36 

5.09 

80-90% 

17 

288 

13792 

3.24 

5.87 

5.66 

90-100% 

124 

897 

41692 

23.66 

18.28 

17.10 


60 BCLUSTERs 


0-10% 

386 

1987 

101122 

49.61 

40.35 

41.49 

10-20% 

33 

545 

29748 

4.24 

11.07 

12.20 

20-30% 

26 

324 

15611 

3.34 

6.58 

6.40 

30-40% 

11 

84 

2968 

1.41 

1.71 

1.22 

40-50% 

23 

. 186 

9496 

2.96 

3.78 

3.90 

50-60% 

16 

188 

7491 

2.06 

3.82 

3.07 

60-70% 

18 

111 

4992 

2.31 

2.25 

2.05 

70-80% 

21 

202 

10242 

2.70 

4.10 

4.20 

80-90% 

25 

381 

18054 

3.21 

7.74 

7.41 

90-100% 

219 

917 

44031 

28.15 

18.62 

18.06 
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Figure C-12 gives the -graph of the percentages of BCLUSTERs within 
each grain percentage class for 20 and 40 BCLUSTERs. The graph of 60 
BCLUSTERs is very close to that of 40 BCLUSTERs. -The differences be- 
tween the number of BCLUSTERs in grain percentage classes is not sig- 
nificant. 

However, when the number of big blobs and number of pixels within 
those BCLUSTERs are graphed (Figures C-13 and C-14) we see that the 
stratification obtained with 40 and 60 BCLUSTERs is much better than 
that obtained with 20 BCLUSTERs. The difference between the 40 BCLUSTER 
and 60 BCLUSTER stratification is small and perhaps the stratum finite 
correction factor would reverse this ordering. 

Evaluation of the blob component ; The purpose of the blob com- 
ponent is to cluster pixels into field-like forms which are relatively 
pure grain or non-grain. These blobs with interior pixels are sampled 

and labeled by analyst. The labeling results are given in Section 

of this appendix. The analyst subjective evaluation of the blob com- 
ponent can be found in this section of the appendix. It was shown that 
those blobs with grain percentage between 25% and 75% were labeled wrong 
more often than those outside of this interval. Figure C-15 gives the 
percentage of blobs at each of the grain percentage levels 0-10%, 

10-20%, 20-30%, 30-40%, 40-50%, 50-60%, 60-70%, 70-80%, 80-90%, and 
90-100%. ' We note the vast majority of the blobs are less than 25% 
grain or greater than 75% grain. If the blob grain percentage is com- 
puted using only the interior of the blobs the results are even more 
striking. Figure C-16 gives the percentage of pixels within the blobs 
in the same grain levels given above. The two graphs are extremely 
similar because the grain percentage and the- blob sizes are nearly 
independent in these 17 segments. 
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FIGURE C-12. PERCENTAGE OF BCLUSTERS VS. PERCENT GRAIN 
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Percent of BCLUSTERS 



FIGURE C-14. DISTRIBUTION OF THE NUMBEI 



Percent of BCLUSTERS 



Percent Grain 


FIGURE C-15. PERCENT OF BIG BLOBS VS. PERCENT GRAIN 


Percent of BCLUSTERS 






Table C-28 gives for each segment, the number of blobs, the number 
of big blobs, the percent of segment covered by big blobs, the blob 
purity, the reduction of variance factors, and the information theoretic 
factor discussed in Appendix K and Section 6. 

Evaluation of the machine labeler : The machine labeling component 

uses Brightness/Greenness values to give wheat or barley labels to all 
of the targets which are labeled grain and have at least three acqui- 
sitions falling in the time interval between plant emergence and har- 
vesting. 

The performance of the labeler will be evaluated both as a classifier 
and as a barley proportion estimate. The percentage of ground truth 
grain targets correctly classified will be computed for each segment. 

The bias, variance, and root mean squared error (RMSE) of the barley' 
proportion estimate will be computed for each segment. 

The relationship between analyst errors and the machine labeler 
errors will be studied by evaluating the machine labeler's performance 
on grain targets missed by the analysts and non-grain targets labeled 
grain by the analyst. 
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TABLE C-28. SEGMENT QUASI-FIELD STATISTICS 


Percent of 


Segment 

Blobs 

No- Big 
Blobs 

Segment 

Covered 

Blob 

Purity 

Blob 

RV 

Blob 

ITF 

1380 

1633 

403 

65.64 

93.07 

.40 

.85 

1392 

688 

380 

87.6 

92.5 

.20 

.76 

1457 

900 

401 

84.1 

85.5 

.15 

.81 

1461 

1506 • 

465 

71.2 

96.9 

.08 

.92 

1467 

554 

400 

91.4 

89.2 

.32 

.67 

1473 

1353 

387 

72.6 

97,6 

.06 

.95 

1518 

819 

420 

87.38 

88.22 

.27 

.93 

1566 

1264 

461 

68.39 

86.97 

.11 

.83 

1602 

1298 

399 

78.5 

‘ 97.0 

.69 

.90 

1612 

577 

329 

90.8 

94.0 

.22 

.78 

1619 

916 

379 

79.5 

96.6 

.09 

.89 

1636 

1660 

427 

80.1 

93.1 

.20 

.75 

1650 

862 

397 

81.3 

85.7 

.44 

.58 

1653 

613 

339 

89.0 

96.6 

.19 

.85 

1656 

631 

322 

87.5 

95.6 

.19 

.83 

1825 

1114 

■455 

80.79 

89.17 

.34 

.82 

1920 

436 

299 

6663 

99.2 

93.8 

.20 

.80 

Ave. 


392 

79.47 

93.03 

.244 

.8182 
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APPENDIX D 

SUPERB — AN IMPROVED SPECTRAL-SPATIAL 
CLUSTERING ALGORITHM 


D.l INTRODUCTION 

SUPERB represents a next generation of spectral-spatial clustering 
beyond ERIM's algorithm BLOB. The new algorithm takes its name from 
"supervised BLOB". As with BLOB, the purpose is to divide a scene into 
patches (which we call quasi-fields or blobs) of adjacent or nearly 
adjacent pixels that are relatively homogeneous spectrally, and to a 
reasonable extent, represent the fields present within the scene. 

Spatial-spectral clustering has been successfully used to specify 
labeling targets for analysts, to provide a head start for spectral 
clustering, to flag mixture or edge pixels, and to form a basis to 
"compress" a scene (forming quasi-field spectral mean values and 
substantially reducing the data volume). The extent to which the 
purpose of spatial-spectral clustering has been achieved was discussed 
in previous reports [ 3 ] , although more work needs to be done to 
examine SUPERB. 

Uhat follows is a description of the basic algorithm of SUPERB, 
and a discussion of its special features. As a summary, the principle 
new features of SUPERB are given in Table D-1, along with the expected 
desirable effects of using those features . These features represent 
the main differences from algorithm BLOB. 

D.2 ALGORITHM DESCRIPTION 

D.2.1 GENERAL 

To help understand the system environment, we mention that SUPERB 
is written in an extended FORTRAN ("Overdrive") as a module within ERIM’s 
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TABLE D-1. DEVELOPMENT HIGHLIGHTS OF SPATIAL-SPECTRAL CLUSTERING PERFORMED 
BY ALGORITHM SUPERB 


Development 

Significance 

Ground truth supervision is available as 
an option. 

Formation of quasi-fields can be forced 
consistent with ground truth boundaries. 

The resulting blobs can be used as ground 
truth fields, in some applications. 

A new distance measure allows separate 
consideration of spatial and spectral 
effects. 

It is easier to judge the physical signi- 
ficance of threshold parameters values. 

A pixel may be assigned only to a quasi- 
field associated with a neighboring pixel 
(or form a new quasi-field) . 

Quasi-fields now can be made to consist of 
a single group of adjacent pixels. 

Fewer distance calculations need to be 
performed . 

When deciding whether to assign a pixel 
to a blob versus form a new quasi-field, 
pixels having the easiest decisions are 
processed first (within each scan line) . 

More candidate neighbor quasi-fields often 
become available to help improve later 
decisions. 

Biasing effects of sequential processing 
order are minimized. 

Newly formed blobs are compared to their 
neighbor blobs, and combined if suf- 
ficiently similar. 

If two blobs happen to start in the same 
homogeneous field, they may be combined. 




QLINE system, QLINE is a modular multlspectral software architecture 
in which a module is called (usually) once for each scan line in a 
scene (the module is called for various kinds of initialization and 
post-processing as well) . A scan line of data is made available to 
the module in the form of a two-dimensional array (channels by pixels 
in the scan line). The module may use or modify the data, for example, 
by adding one or more channels representing processed data. 

Having discussed the environment in which SUPERB exists, we now 
describe the processing carried out for each scan line. First, the 
pixels are processed sequentially right to left. A distance measure, 
one of those selected from Table D-2, is computed between the pixel 
and each candidate blob. The candidate blobs are all blobs that were 
assigned to pixels neighboring the pixel being processed. (This neigh- 
borhood can be modified by a user to any subset of pixels up to two 
units away, but normally consists of five neighbors — upper left, upper, 
upper right, right, and left. On this first pass, the left neighbor, 
and sometimes the right, would not yet have an assigned blob.) For this 
pixel, the spatial-spectral distance D to the nearest candidate blob is 
compared to a pair of thresholds d^ and d^. If D d^, the pixel is 
assigned to the blob and is used to update the stored blob mean vector. 
If D dj^, the pixel forms a new blob. However, if D is between d^ 
and the decision is delayed, and the pixel is placed on a deferred 
list along with its value of D. 

Once all pixels are processed in this manner, the list of deferred 
pixels is sorted in order by descending values of ]c-d^[, where d^ is 
the final decision threshold (d^ • Each of these pixels are 

processed in sorted order by computing distances from the pixel to a 
new set of candidate blobs. (There may be more candidate blobs during 
the second try for a given pixel, since its neighbors may have been • 
assigned to blobs.) The new smallest distance D is compared to thres- 
hold d^. If D > d^, the pixel forms a new blob. Otherwise, it is 
assigned to the nearby blob with distance D. 
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Of the three distance measures described in Table D-2, the first, a 
composite of spectral and spatial measures, is one of the options in BLOB. 
The second is a pure spectral measure, while the third is the new dis- 
tance vector consisting of two elements (D^ and D^) representing spectral 
and spatial components, respectively. In the last case, the three deci- 
sions given in the above paragraphs must be specified in terms of and 
D^, as shown in Table D-2. In this case, deferred pixels are sorted in 
descending order of + D^. 

The’ foregoing discussion covers the basic mechanism, when no special 
conditions or options are present. What follows is the discussion of 
several features and conditions that can modify the basic picture some- 
what. 


D.2.2 GROUND TRUTH SUTERVISION 

If available, ground truth can be used to constrain the algorithm 
so that each blob must consist of pixels of a single ground class. 

In one of two supervision options, the ground truth is supplied 
in the form of an integer code for each channel. When distance mea- 
sures are computed between a pixel and its neighbor blobs, only blobs 
with matching ground truth code are considered. Furthermore, codes of 
zero are used to mark pixels as mixture or unknown, and such pixels are 
assigned to a single garbage blob. 

In the second supervision option, the ground truth is supplied 
in the form of several subpixel codes for each pixel, combined with 
one code that summarizes the several. The summary code is zero if 
the subpixel codes do not all agree, and is the same as the subpixel 
codes if they agree. In this option, the candidate blobs for a given 
pixel are neighbor blobs whose ground truth code matches at least 
one of the subpixel codes for the pixel. When a pixel forms a new 
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TABLE D-2. DISTANCE 'MEASURES THAT MAY BE SELECTED IN SUPERB 


Decision Rules 


’^1 


n . r« - I (p - 

2 L ■'* _ 


Assign Pixel 
to Nearest 
Blob, 
if* 





^2 A 


Pass 

Deferred Pass 

Form New 
Blob, 
if* 

Assign Pixel 
to Nearest 
Mob, 
if** 

D > dj, 

D :< dj 

D > d, 
— . h 

D £ 








^2 - 



Otherwise defer the decision. 


■ Table Notes ; 

D “ overall pixel~to-nearest-blob distance 
Dj^ = spectral pixel-to-nearest-blob distance 
D 2 “ spatial plxel-to-neacest-blob distance 
d^ = low threshold 

- low threshold for spatial 
dj^ = high threshold 
Sj^ = high threshold for spatial 
d, - final threshold 


Otherwise form 
new blob. 


Sg ”= final threshold for spatial 


^ allowed variance in channel i 
Vj^ = allowed variance in line coordinate 
Vp =• allowed variance in point coordinate 
x^ “ pixel value in channel 1 
“ blob mean value in channel i 
S, => line coordinate of pixel 
1 = mean line coordinate in blob 
p = point coordinate of pixel 
■p “ mean point coordinate in blob 
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blob, the ground truth code for the blob is the most frequent subpixel 
code in the pixel. If there is a tie for most frequent subpixel code, 
the one-pixel blob is considered "unlab elable", and is combined with 
the garbage blob. If the most frequent subpixel code is zero, the 
one-pixel blob is combined with the garbage blob. 

TJhichever supervision option is selected, the function is to limit 
the set of neighbor blobs that are considered in computing distance 
measures to (and therefore possibly being combined with) each pixel. 

An additional effect of supervision is that ground truth mixture pixels 
will not form new blobs except during deferred pixel processing. This 
may reduce the chance of a mixture pixel that results in an unlabelable 
one-pixel blob, since a neighboring pure or less ambiguous pixel may 
initiate a blob that would absorb the mixture pixel. A second additional 
effect is that, for each pixel, a result ground truth code is produced 

that reflects the ground truth code of the blob containing that pixel. 
This code may be a stronger indication of the pixel’s primary class than 
the ground truth codes themselves, since slight misregistrations of the 
ground truth can be overridden for mixture pixels by spectral similarity 
to neighbor classes. 

D.2.3 USE OF SCREEN INFORMATION 

Algorithm SCREEN [ 36 ] is designed to examine Landsat data and 
identify those pixels which probably aren’t good agricultural data, 
such as clouds, water, etc, and this can be carried out for each 
acquisition in multitemporal data. When SUPERB is run, it can 
examine information from SCREEN and tell, on a pixel-by-pixel basis, 
which channels contain valid data,,. Usually, all channels are valid, 
in which case SUPERB works normally. 

The behavior is modified, when some channels are not usable, as 
follows. The spectral part of the distance measure is computed as: 
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(== 1 -^ 1 ) 


i=-|good channels} V. 


X 


(number of channels specified) 
(number of good channels) 


where the good channels are those specified that are not flagged by 
SCREEN for the pixel, and also are not flagged in SCREEN information 
maintained for the blob. 

When a pixel is used to form or update a blob, care is taken so 
that channels flagged (as clouds, etc) for the pixel do not change 
the blob mean in those channels, and channels flagged for the blob 
cannot be updated but only replaced by unflagged pixel channels . 

There must be at least a specified minimum number of non-flagged 
channels in a pixel, or the pixel will be assigned to the garbage 
blob. It is occasionally possible for a finished blob to occur with- 
out any valid data in some channels, in which case those channels 

i 

would contain bad information (namely the values for the first pixel 
assigned to the blob). These are indicated with a warning message, 
but currently are not discarded or otherwise marked. 

D.2.4 BLOB NEIGHBOR TABLE 

Algorithm SUPERB as an option can prepare a table that lists for 
each blob, a list of blobs that neighbor it. The definition of neigh- 
borhood used to form this table is the same as the neighborhood des- 
cribed previously for establishing candidate blobs for a pixel when 
determining the pixel's assignment. A blob is adjacent to another if 
any pixel of one blob is in the neighborhood of any pixel of the other 
blob. The resulting table can be written and stored for use by blob 
mapping algorithms and other analysis routines that work with blobs. 
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D.2.5 BLOB COMBINATION OPERATIONS 

Occasionally, it can happen that a group of adjacent spectrally 
similar pixels which normally would comprise a single blob, may be 
started from two sides, resulting in two blobs rather than one. One 
way this can happen is that the upper corner of a field on one scan 
line contains a pixel that starts a blob, then on the next scan line, 
a pixel on the opposite side of the same field starts a blob also. 

A first attempt to reduce this possibility was based on the fact , 
that rectangular fields (in the U.S.) tend to be rotated a few degrees 
counter-clockwise to the scan line direction, so that the upper corner 
of a field normally belongs to the right-hand side of a field. When 
the order of processing was changed from lef t-to-right to right-to-left, 
the problem of multiple starts was reduced, but still occurred for some 
field .geometries. 

In order to more fully solve the problem of multiple starts , an 
option was implemented to consider combining adjacent blobs. After 
all pixels on a scan line are assigned, any newly formed blob is com- 
pared to its neighboring blobs, taken from the blob neighbor table. 

Using the mean of a newly formed blob as if the blob were a pixel, 
the distance function is computed to each neighbor blob (assuming that 
if supervision is in effect that the blob ground tru.th codes match) . 

If the nearest blob is sufficiently close (using thresholds d^ and 
which may but need riot be the same as d^ and s^ as discussed above) , 
the blobs are combined, taking the same care with SCREEN information 
that is used when assigning a pixel to a blob. 
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APPENDIX E 

LOGIC POR PLACEMENT OF LANDSAT DATA INTO COLOR IMAGERY 
E.l INTRODUCTION 

A first principle in design of imagery is that independent, impor- 
tant features of the data should translate to independent dimensions of 
color variation as perceived by the human eye. In this way, one allows 
the visual "information processor" to correctly come into play in ana- 
lyzing the image. Imagery acts as the interface between the data and 
the human being, tying us into a psychological processor which is quite 
impressive for its pattern recognition capabilities. To best use this 
processor it is important to understand and respect its inherent manner 
of classifying a scene, literally its way of looking at things. This 
appendix describes where these considerations have led us in designing 
imagery for Landsat data, and what logical options appear to be available 
to us, given our current understanding of the data structure. 

E.2 MAPPING DATA FEATURES TO COLOR VARIATION 

It is possible to place more than one component of information 
about a scene into a color image, and still have those components inde- 
pendently apparent to and assessible by an observer. This is possible 
because perception of color is multidimensional. Colors are perceived 
to vary in three independent characteristics or along three independent 
dimensions. If we control each independent way of changing a color 
with a different information component then the information components 
remain independently apparent and assessible. 

The importance of making our information components control inde- 
pendent dimensions of color is not hard to see. Under this arrangement 
we are properly linking up with the visual information processor. The 
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observer (image analyst) perceives the information components in the 
scene as though the human eye had evolved so as to pick them out. The 
effort involved in interpretation of color is minimized because much 
of the work of classification occurs immediately in the visual system. 
The Information is presented in the form required for pattern recogni- 
tion by the observer. 

The predominant dimensions of color variation are three in number. 
They bear the names hue, saturation and lightness and are defined as 
follows [53]: 

Hue is the attribute of a color perception which gives 

rise to the names red, purple, yellow, etc. 

Saturation is the attribute of a color perception determining 
the degree of its difference from the achromatic 
(grey) color perception closest to it, i.e., of 
the same lightness. 

Lightness is the attribute of a color perception permitting 
it to be classed as equivalent to some member of 
the series, of achromatic color perceptions ranging 
from black to white (grey levels). 

The attributes of hue, saturation and lightness are subjectively defined 
and are not predefined in any analytic or quantitative sense. 

A second important concept in mapping data variation into color is 
the perceptual scaling of color differences. It is possible to find 
a sequence of colors between two given colors such that the perceived 
size of the color change from step to step is uniform, i.e., the same 
at each step. Naturally, we wish the distance relationship between data 
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points to be preserved for the observer of the image. The way to do 
this is by mapping the data interval onto a uniform color scale of the 
sort described. In this manner, a given distance in the data interval 
is consistently transformed to color differences of the same size. This 
is important since large deviations from uniform transformation will 
certainly give us a distorted view of data variation in our image . 

In order to portray more than one variable in color while main- 
taining uniform distance translation we need an extension of the con- 
cept of color uniformity to more than one dimension. The single dimen- 
sional uniform color scale gives rise to the concept of placing colors 
in two- or three-dimensional configurations for which distance between 
any colors still correlates to the psychologically perceived difference 
between those colors. The Munsell color specification system is an 
attempt to set up such a configuration for all object colors. The con- 
figuration was arrived at empirically and is defined by cards of color 
chips. Many attempts have been made to describe the desired spacing of 
colors mathematically. The usual starting point is the X,Y,Z color 
space. Transformations are made on these variables to arrive at a 
color space which is more uniform to the eye's judgement of color dif- 
ferences. Currently the L*ja*,b* color space is the standard Uniform 
Color Space designated by CIE. Its relatively simple equations obtain 
a color spacing in good agreement with the Munsell system. 

E.3 DIFFERENT COLOR SPACES, THEIR UNIFORMITY AND THE LOCI OF HUE, 

SATURATION AND LIGHTNESS 

E.3.1 X,Y,Z COLOR SPACE [53] 

We do not perceive a unique color for every different light stimu- 
lus presented to the eye. For example. Figure E-1 shows 12 reflectance 
curves which are markedly different but which nonetheless give rise to 
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FIGURE E-1 [53]. TWELVE SPECTRAL RESPONSE FUNCTIONS OF OBJECT. COLORS 
METAMERIC WITH RESPECT TO CIE SOURCE C AND THE 
1931 CIE STANDARD OBSERVER 
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the same color perception. The reason for this is that the eye has a 
finite number of "sensor channels", in fact it has only three. Three 
pigments, located in the cones of the retina, are involved in color 
vision. Each pigment has a different spectral sensitivity to light so 
each acts like a sensor channel with a different bandpass.- Many differ- 
ent spectra can produce the same color sensation as long as the trio of 
channel values, referred to as the "tristimuli’s values", end up the same. 

A set of spectral sensitivity curves for the eye's three receptors 
have been defined for a standard or ideal observer, based on results of 
color matching experiments. These three weighting curves appear in 
Figure E-2. They are labeled b^, g^ and and are referred to as color 
matching functions. In 1931, the Commission Internationale d'Eclairago 
or CIE set down a linear combination of the color matching functions with 
certain specially desirable properties, to be the basis of a standard 
color space. The color matching functions used by the CIE are shown 
in Figure E-3 and are labeled x^, y^, z^. 

When these weighting curves are applied to spectra, the three re- 
sulting numbers are called tristimulus values and labeled X,Y,Z. In 
constructing the weighting functions x^, y^, z^ the constraint was made 
that y, would have the same shape as the photopic (cone vision) luminous 
efficiency function. Thus the Y tristimulus value does double duty — 
it also is a predictor of color lightness. Empirical tests have shown 
that Y is a good, approximate predictor of perceived lightness (in the 
sense of lightness matching between colors, not in the sense of uniform 
spacing of color lightnesses), but has a small systematic problem, namely 
underestimating the lightnesses of saturated colors. The additional con- 
straint was made that the x^, y^, z^ weighting or color-matching func- 
tions would be everywhere non-negative. Since spectra are non— negative, 
this implies the X,Y,Z tristimulus coordinates will always have non- 
negative values. 
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FIGURE E-2 [ 53 ]. COLOR-MATCHING. FUNCTIONS, 

IN THE PRIMARY SYSTEM R = 700.00 nm, 
G = 546.1 nm, B = 435.8 nm 
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FIGURE E-3 [ 53 ]. COLOR-MATCHING FUNCTIONS OF THE 
1931 CIE STANDARD OBSERVER 
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In X,Y,Z space, colors of fixed chromaticity (hue and saturation) 
fall on lines which emanate radially from the origin. Hue and satura- 
tion are determined by the ratio of the X,Y,Z tristimulus coordinates 
and a constant ratio among the coordinates is found on any line out 
from the origin. To see why- this is so, imagine the following experi- 
ment. Light is projected onto a screen from a white light source after 
passing through a given, fixed color filter. The white light source has 
a dimmer switch so we can vary its output. As we vary the source from 
dim to bright the perceived lightness of the projected, filtered light 
will change correspondingly; however, its perceived chromaticity, i.e. , 
hue and saturation, will remain the same. Now if we calculate the X,Y,Z 
tristimulus values for the color at several different lightnesses we will 
find the magnitude of the vector varies, but not its direction. This is 
a consequence of the fact that the X,Y,Z measurements respond linearly 
to changes in energy. If you double the energy of a spectra, or if you 
half it, each of the X, Y, and Z values changes correspondingly and 
their ratio remains unchanged. Thus the locus of points in X,Y,Z space 
which could be produced in this experiment fall on a line out from the 
origin with direction determined by the filter used. 

Finding the locus of constant hue with variable saturation is not 
so simple. Neither is finding the locus of constant saturation with 
variable hue. Isolating hue or saturation is a psychological judgment 
without a simple mathematical correlate in X,Y,Z space. Lines of con- 
stant hue and lines of constant saturation have been empirically mapped. 

A typical result of this mapping is shown in Figure E-4, plotted in the 
x,y chromaticity diagram for a fixed lightness. Notice that hue tends 
to be radially distributed while saturation tends to be cylindrically 
distributed. 
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FIGURE E-4 [ 53 ]. 1931 CIE CHROMiTICITY DIAGRAM SHOWING LOCI OF CONSTANT 

HUE AMD CONSTANT SATURATION AT LIGHTNESS VALUE 3/ OF 
MUNSELL RENOTATION SYSTEM 
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E.3.2 L*,a*,b* COLOR SPACE 

If the eye gauged differences between colors linearly with respect 
to differences in energy of the light stimuli, then the X,Y,Z color 
space would be an exactly uniform color space. However, the eye does 
not judge color differences linearly with respect to energy differences. 
Empirical findings show that the psychological magnitude of color dif- 
ferences are a function of energy differences under transformation by a 
logarithm or fractional power less than 1/2. One formula does not work 
under all conditions because the background of the colors being compared, 
i.e., the adaptive state of the observer’s eye, is an important factor. 

A logarithmic transformation works best when the background is a color 
in between the colors being judged. A cube root transformation works 
best given a constant middle grey background (reflectance of 20%). A 
square root transformation works best given a white background for all 
observations. As a standard transformation, the cube root formula has 
found favor. It has found a place in the Munsell color specification 
system and the equations of L'*,a’%b* Uniform Color Space. 

The L*,a",b'' standard Uniform Color Space is built from the X,Y,X 
color space in a few straightforward steps. First, coordinates are nor- 
malized to the nominal white of your particular color production hard- 
ware. Denote the tristimulus values of "white" by X ,Y ,Z and normalize 
all other coordinates by forming the ratios X/X^, Y/Y^ and Second ,- 

take the cube root of each coordinate axis. This step Incorporates the 
understanding of psychological equispacing discussed above. Third, a 
linear transformation is applied to the axes to line them up with what 
are believed to be the directions of chromatic balance which the eye 
uses in differentiating color. These poles of chromatic balance are 
blue yellow and red ys^. green. The a* coordinate labels the red/green 
balance while b* labels the blue/yellow balance. The transformation, 
thus far described, has the following form: 
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The fourth and final steps of the transformation weights the coordi- 
nates to obtain a color spacing more correlated with psychological' judg- 
ment of color differences. The complete transformation in its accepted 
form appears as follotvrs: 

L* = 25 (100 -Y/Y - 16 

o 

a* = 500 

b* = 200 [(Y/Y^)^^^ - (Z/Z^)^^^] 

What should L'',a*jb'* space look like In principle? Where are 
colors located and in what manner are they organized, as far as the 
psychological dimensions of color perception are concerned? The pre- 
dominant psychological dimensions of color are lightness, hue and satu- 
ration. We will review the structure of L",a ,b color space with respect 
to each of these dimensions. 

L" depends solely on the Y tristimulus value. The Y value predicts 
color lightness to a fair approximation. The L ’ function only acts to 
spread out the lightness measure so that 1) it is more nearly uniform 
to psychological distance estimation and 2) its scale is approximately 
commensurate with the scales of the chromaticity axes, a and b , 
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Hue in L'',a”',b'" space is not well defined and can be held constant 
only to an approximation. Chromaticity in L*,a*,b* space is specified 
by the two dimensional, rectangular coordinates a* and b*. As mentioned 
above, this pair of coordinates divide up chromaticities by finding the 
balance of the color on two scales ■ — a blue to yellow scale and a red 
to green scale. In principle, hue should be preserved in moving radi- 
ally out from the L“ axis as illustrated in Figure E-5 (a) . As long as 
the ratio of the chromaticity scales is maintained, hue should approxi- 
mately be maintained. If L'',a*,b* space was truly a Uniform Color Space 
this would have to be true — in order for the straight line between 
two points of the same hue and on the same radial line to be the geo- 
desic or minimal distance path between them, no hue variation could 
occur along the path. However, L*,a",b* space is not, even in principle, 
a truly Uniform Color Space. L ,a ,b ' space is only an approximation 
to uniform scaling. It was selected as a standard because it worked 
about as well as other spaces (in matching empirical findings for small 
color differences) and, at the same time, was mathematically very simple. 
What this means is that hue need not be precisely radially distributed 
out from the L” axis. If the exponent of the L*,a*,b" transformation 
were changed, say to 1/4 or 1/2 or 0.4, then the colors which are associ- 
ated by virtue of being in the same radial direction would change. As 
discussed above, these values of the exponent would not be unreasonable — 
there is nothing magic about the exponent 1/3. Therefore, the loci of 
constant hue are not well defined, l.e., analytically defined in L*,a*,b* 
space. We can only say hue will be approximately radially distributed. 

Saturation in L'',a*,b* space is distributed radially from the ori- 
gin of the space. Figure E-5(b) shows a plane of L",a*,b" space (con- 
taining the L'' axis) with lines of constant saturation drawn in. In 
this plane we are approximately fixing hue. The loci of constant satu- 
ration for varying hue are not simple. Clearly saturation will tend 
to vary in cones of increasing opening angle. (Imagine the surfaces 
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(a) Idealised Hue and Satiuation Intervals at L*-50 


(b) Idealized Constant Saturation for a Fixed Hue 


FIGURE E-5. LIGHTNESS, HUE AND SATURATION IN L*,a*,b* SPACE 


2eri 

generated by rotating the lines of Figure E-5(b) about the L* axis.) 

To be more exact than this we would have to translate the empirical 
findings about judgment of saturation, as displayed in Figure E-4, 
into. L" ,a*,b" coordinates. Figure E-5 (a) illustrates contours of 
constant saturation. 

In X,Y,Z space the locus points for which chromaticity (hue and 
saturation simultaneously) is constant is a line radiating from the 
origin of the space. This was explained’ in the above section on X,Y,Z 
space. The transformations involved in passing to L*,a*,b* space pre- 
serve this property. We have discussed where the loci of constant hue 
or constant saturation fall above. The locus of a constant chromaticity, 
hue and saturation considered simultaneously, is still a straight line 
radiating from the origin. The cube root transformation moves these 
lines around — basically it draws more saturated colors in towards the 
diagonal of the space. However, tbe lines of constant chromaticity 
remain straight lines radiating from the origin. 

E.3.3 THE COLOR GUN CUBE AS A COLOR SPACE 

The exposure control system of the Production Film Converter is a 
means of color specification and, hence, the inputs to the system may 
be thought of as coordinates in a color space. The coordinates are the 
three gun numbers we input to the PFC and the resulting color space 
volume we refer to as the color gun cube. In the past, many assumptions 
were implicitly made about properties of this color space which were 
simply false. They were very appealing assumptions to make — it was 
assumed that the space was uniform, that lines of constant chromaticity 
radiated from the origin and that color lightness was determined by the 
sum of gun counts . A sizeable amount of vjork, related to designing 
film products [23524,25] and to analyzing those products [54,55,56] was 

* 1 ' ^ 

The origin of L ,a ,b space is (-16,0,0) to correspond with X = 0, 

Y = 0, Z = 0. 
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based on those implicit assumptions. In this section we review what a 
color science analysis predicts about the PFC color gun color space. 

The PFC color gun color space is a logarithm color space. Whereas 
L*,a'',b'' space used a cube root transformation on transmission, the PFC 
uses a base 10 logarithm. This is the way the PFC was set up to work. 

A set of three look-up tables were built into the PFC, one for each 
primary color gun, which establishes a linear relationship between gun 
counts and the image densities (i.e., the densities of the film to 
transmission of light at the three wavelengths which represent the pri- 
maries of the film, density is the logj^Q of transmission which is the 
percentage of light energy the film lets through) . Note that the loga- 
rithmic transformation is not applied to the axes of X,Y,Z space but to 
the primaries of the PFC, each of which is expressable as a vector in 
X,Y,Z space. 

In applying a logarithmic transformation to a transmission color 
space one does not preserve the property that lines of constant chroma- 
ticity radiate from the origin. Any exponential transformation preserves 
this property but the logarithm does not. The logarithm changes fixed 
ratios to fixed distances. Consider two lines which meet in the origin 
of the original space. Each of these lines represents a fixed ratio 
among the coordinates of the space. If we are talking about a color 
space in which the coordinates measure transmissions, then these are 
two lines of constant chromaticity. If we now apply a logarithmic 
transformation to the coordinates of the space the two lines with dif- 
ferent fixed ratios become two lines with a fixed distance between them, 
in other words, parallel lines. Mathematically, it looks like this. 
Consider two lines in the original space linked with the parameter t: 

r = t (x^,y^,Zj^) 

3^2 = t 

0 ^ t ^ 
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Now apply a logarithmic vector transformation, L, to the space in the 
following manner: 

u = log X 
V = log y 
w = log z 


The two lines are transformed as follows : 

h = == + (log t, log t, log t) 

_x 


where = log x^, etc- Now look at the point-for-point difference 
between the transformed vectors : 

h - ^2 ■ ‘"r"2> ''l-''2’ ”l-’'2> 

We see that it is a constant vector. The vectors and are paral- 
lel. By the same token, all lines in the original space which meet in 
the origin end up parallel to one another. 

The currently used alternative image product, the Kraus Product 
or Product 3, is claimed to maintain consistency in color definition. 

A basic design principle of the product was that data points with the 
same ratios among the three channels should be mapped to colors of the 
same chromaticity (hue and saturation), with different lightnesses. To 
do this is to preserve "angularity" (a word coined by Richard Juday/JSC). 
The Kraus Product attempts to do this by preserving data channel ratios 
in the color gun coordinate ratios. This would work, provided that the 
lines of constant ratio among the color' gun coordinates were indeed 
lines of constant chromaticity. As we have seen, such is not the case. 
Lines of constant chromaticity are parallel. Along a vector out from 
the origin, color saturation does not remain constant. It increases, 
changing from zero (achromatic) to fully saturated. 
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Lightness of colors cannot be predicted by summing the color gun 
counts. For example, using 255 counts on the green gun produces a much 
lighter color than using 255 counts on the blue gun. This is because 
the eye is more sensitive to the wavelengths of light permitted by the 
green primary. A weighted average of the gun counts would do better 
but would probably not be adequate either. From a color theory stand- 
point , the iso-lightness contours in the color gun cube will not be 
planes . 

The perceptual uniformity of the color gun cube to visual color 
difference estimation is problematic. One would expect the logarithmic 
transformation to improve uniformity over a space which measures trans- 
mission. Beyond that, the degree of uniformity is open to question. 

Some slices of the FFC color gun cube have been placed on film. 

The remarks made above about chromaticity and lightness in the color 
gun cube, and its uniformity, are all consistent with observations 
which can be made from the film slices of this color space which we 
have generated. 

E.4 MAPPING TASSELED-CAP VARIABLES TO COLOR 

The way we would choose to place the Tasseled Cap into color 
depends upon what we think are really the independent features we 
want to portray. This comes down to asking whether the features are 
actually rectangular Brightness and Greenness or polar Green Angle 
and Brightness Radius. 

E.4.1 RECTANGULAR COORDINATES 

If we believe that our data space is fundamentally rectangular 
then the most appropriate color space for us to map it into is similar 
to the PFC- color gun cube. The PFC's logarithmic color space separates 
changes in saturation and lightness in a rectangular, rather than radial. 
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manner. If we want lines of constant Greenness to fall onto lines of 
constant saturation (we assume a fixed hue), then we need a rectangu- 
lar color space. 

Investigations of a modification of the PFC color gun cube to 
determine if such a space could be a realistic option in our work have 
been pursued. The modified space, termed L,C,D space, is a linear 
transformation of the cube defined as follows: 

/.ll .57 ,32\ /b\ 

f 0 1/2 -1/2 ] ( G I 

^1/2 -1/4 -1/4 / \r y 

The C coordinate measures a red/green balance while D measures a blue/ 
yellow balance. The L coordinate is an attempt to come as close as 
possible to isolating lightness levels within the cube. Classical color 
theory predicts that iso-lightness levels in the logarithmic space will 
not be planes. Analysis of imagery filmed on the PFC should be per- 
formed to determine how much they deviate from being planar. There is 
reason to’ believe the distortion will actually be less than that apparent 
in the images of L*,a*,b* space we have analyzed. Of course, our version 
of L*,a*,b'' space is based on a model of the PFC with acknowledged inaccu- 
racies, currently being addressed by Juday. 

The essential characteristic of the color gun cube, that saturation 
varies as a rectangular coordinate, is retained under this transforma- 
tion. It would be interesting to map data into this space as an experi- 
ment. The uniformity of the space would have to be assessed empirically, 
as would the success of isolating lightness levels. 
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E.4..2 POLAE COORDINATES 

It appears that the Landsat data space may be characteristically 
radial. Intuitively it seems reasonable that spectral data vectors 
with the same direction but different magnitude represent objects which 
are somehow similar. Within the resolution of the sensor they repre- 
sent identical spectral responses with different overall brightnesses. 
That actually may mean nothing. We have no priori reason to believe 
that the two objects in question are really similar in some sense. The 
modeling work of Reference [57] suggests that, in the case of plant 
canopies, breaking vectors into direction and magnitude is a meaningful 
way of looking at them. It appears that vectors of the same direction 
represent a common stage of plant canopy development while vector magni- 
tude varies with the miderlying soil brightness . Figure E-6 shows model 
results for various degrees of plant cover and three soil brightnesses. 
The radial pattern apparent in this figure is the reason for saying that 
Landsat agricultural data may be characteristically radial. 

We can make vector magnitude and direction the visually independent 
features in our imagery in either of two ways. We could transform to 
polar coordinates , Greenness Angle and Brightness Radius , and proceed 
to map -these variables into a fundamentally rectangular color space, 
such as L,C,D space. We could also map the data space linearly into a 
color space which is fundamentally radial. The latter approach has been 
pursued because we would like to use the L*,a*,b* Uniform Color Space 
and it is a radial color space. Our scheme is to map the apparent data 
origin (not the zero vector) to the origin of color space (L* = -16 , 
a* = 0, b* = 0) and align Brightness with L*. This preserves the radial 
distinction in the data, as indicated in Figure E-7. The Greenness 
direction can be rotated to any hue we find convenient. In the imagery 
currently being processed we have tried placing Greenness along the 
b* axis as well as in the direction of the green primary. 
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SOIL BRIGHTNESS COMPONENT OF REFLECTANCE 


FIGURE E-6. TASSELLED-CAP TRANSFORMATION OF SIMULATED WHEAT 
REFLECTANCES, ILLUSTRATING SOIL BRIGHTNESS EFFECTS 
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APPENDIX F 

SIGNATURE CHARACTERIZATION STUDIES 


This appendix contains a broad description of a method for extract- 
ing multitemporal, multisegment signatures from training data. As of 
now, the method has not been evaluated. Section F.l outlines the method. 
Section F.2 gives some theory on estimation with missing data which 
might lend this method some plausibility. Section F.3 gives the for- 
mula, required by the method, for computing conditional expectations 
when the underlying joint density is a mixture density. 

F.l MULTITEMPORAL, MULTISEGMENT SIGNATURES 

A method is outlined for extracting signatures from multitemporal, 
multisegment training data. We suppose that for a fixed object class, 
say wheat, there is data from T passes with T 20. For each of the 
S segments, S 5 j 40, we have data for about four of the passes. If we 
think of feature space as- being L, L « 2, spectral features over all 
T passes, then the data is incomplete in that we observe about 4 x L 
components out of a total of approximately p = L • T s;40. We think 
of each data vector as being of length L • T with real values for cer- 
tain components and blanks for the others. The number of incomplete 
data vectors from each segment is approximately 50 (e.g., 50 wheat blobs). 

Thus the data set may be characterized as follows: We have about 

2000, incomplete data vectors of length 20L. The missing components are 
approximately the same within each segment, but may vary from segment . 
to segment. The number of missing components is about 80% of the data 
vector length. 

We would like to obtain a p-variate density function which charac- 
terizes the set of the training data vectors. A reasonable form for the 
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density is a convex mixture of p-variate Gaussian densities, i.e. , a 
density f of the form 


r 

f= I 

i=l 


r 

where A. :s: 0, 7 X. = 1, and f. are p-variate Gaussian densities; with 

x=l 

r, the and the parameters of the f^ unknown. Rassbach et al [7] 
have developed a procedure for doing this when the data vectors are com- 
plete. The computer program implementation is called CLASSY. 

We describe an iterative procedure for extending CLASSY to the case 
of incomplete data vectors. A flow chart of the procedure is given in 
Figure F-1. The first step is to fill in the missing data values. To 
do this we use the method of profiles (see Section 4.4 and Appendix A). 
Loosely speaking, this method fits a curve based on empirical data to 
the observed data values and permits extrapolation and interpolation. 

The next step is to employ CLASSY on the completed data vectors to obtain 
a signature density function 

f = y X.f. 

^ XX 


Next is a decision as to whether to stop the iteration. The cri- 
terion for stopping might be a fixed number of iterations or might be 
based on the change in f after successive iterations. 

If the iteration continues, the next step is to fill in the incom- 
plete data vectors using the method of conditional expectation relative 
to f . By that we mean the following. Let Z be a random variable having 
density f. Let V be the random variable determined by a particular 
(observed) combination of components of Z, and Y the random variable 
determined by the remaining (not observed) components of Z. Then we 
have the conditional expectation E^t^(v) of Y given V = v. Now for a 
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FIGURE F"l. FLOW CHART FOR SIGNATURE EXTRACTION PROCEDURE 
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specific incomplete data vector z^, let denote the vector of observed 

components. Let the random variable V be associated with this set of 
components and let the random variable Y be associated with the remain- 
ing components. Then the missing components are filled in with the 
values 

It would be interesting to try this procedure on a small scale 
using one spectral feature, say Greenness, and a limited number of 
passes, say 10. The following remarks may be helpful: 

1. Our concept is that the data vectors are actually blob means. 

2. The signatures will be both wheat and non-wheat data sepa- 
rately clustered. 

3. There does not appear to be any conceptual difficulty in 
incorporating collateral features into this signature extrac- 
tion procedure. 

4. The problem of estimation with incomplete data has been con- 
sidered by Boullion |3]J et al. They consider the case of a 
sample from a multivariate Gaussian. 

5 . There is concern that the covariance matrix obtained after 
the first step might be ill-conditioned. It may be necessary 
to add a diagonal matrix to it to overcome this difficulty. 

6 . An excellent presentation of conditional expectation for 
Gaussian distributions is in Anderson [5$]; note especially 
Theorem 2.5.1, p. 29. 
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F.2 ESTIMATION WITH INCOMPLETE DATA 

We supply here some background to lend plausibility to the procedure 
described above. Let X denote a data matrix of dimensions p x N with 
the columns representing the data vectors. We suppose that some of the 
entries of X are missing. For each missing entry we substitute a real 
variable. If r of the entries of X are missing, then we denote the vector 
of variables representing them by 



Thus X is a matrix valued function of the vector x, 

X = X(x). 

Suppose that the columns of X are observations from a p-variate 
Gaussian distribution with unknown mean vector u and unknown covariance 
matrix M. We want to find the maximum likelihood estimate of the 
triplet XjPjM, 

Let p (x) denote average column vector of X and let A = A(x) denote 
the matrix obtained from X by subtracting ll(x) from each column of X. Set 

M(x) = ^ A(x)A^(x) 

and le t 

\ 

1m(x) I 

denote the determinant of the matrix M(x). Then the following theorem 
may be proved by modifying the proof of Theorem 3.2.1 in Anderson [ ]. 
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Theorem 1: (x*,p*,M*) is a maximum likelihood estimate iff 


1 X is optimal for 
min. [m(x) I 

X 

ii y* = 1 j(x*) 

iii M* = M(x*) 


(F-1) 


In order to describe an Iterative procedure for optimizing (F-1) , 
we require some notation. Each column of X contains observed components 
and missing (not observed) components. If the mean y and covariance 
matrix M were known, we could fill in the missing components with their 
conditional expectations given the values of the observed components. 

If we do this for each column of X, we will have filled in all the 
components of x. We call the values of x obtained via this process 
the conditional expectation of x given the observed entries X ^ of X 
relative to y and M and denote it by 


E[x X^;y,M] 


For convenience we define g(x) by 


g(x) = E[x X^;y(x),M(x)] 


( 0 ) 


Consider the following Iterative procedure. Choose x arbitrarily 


. gCx»>) 


. g(x<») 


X 


(n) 


g(x 


(n-1) 


) 
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It can be shoxorn that 


with equality iff 


^(n) . ^(n-1) 


If x^’^^ converges to x*, then 

X* = g(x*) 

In fact, the limit of any convergent subsequence of the x^^^ will be a 
solution of the equation x = g(x). If (x*,v*,M*) is a maximum likelihood 
estimate, then 


x"' = g(x^), u* = y(x*), M’' = M(x'') 


From these conclusions, we obtain the following theorem. 


Theorem 2: If there is unique x for which 

X* = g(x*) 

then x*',vT(x*) ,M(x*) is the unique maximum likelihood estimate and the x^^^ 
in the iterative procedure will converge to x , 

The following question has not been answered: What condition does 

X(x) have to satisfy to insure a unique solution to the equation x = g(x)? 


F.3 COMPUTATION OF CONDITIONAL EXPECTATION 

At each step in the iteration of the procedure of Section F.l, the 
CLASSY stage provides an estimate of the signature density f which is a 
convex mixture of Gaussian densities, i.e., 

n 


• l6l 




where ^ 0, ^ X. = 1, and the are Gaussian densities. We give here 
‘ i 

the, formula required for computing the conditional expectation of the 
missing variables given the observed variables relative to f. Let 
Z ^ ^ random variable with density f (Y represents the missing 

components of a particular column of the data matrix and V represents 
the observed components), and let 



be a random variable with density f^. We want to relate the conditional 
■ expectation 

E(y|v = v) = h(v) 
to the conditional expectations 

E(Y.|V. = v) 5 h. (v) 

To this end, let 

f^(v) = j f(y,v)dy 

denote 


denote 


But 


the marginal distribution of V and let 

fy. (v) 1 g i g n 

i 


the marginal distribution of V^. Then 


h(v) = 


f^(v) ^ 
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and 


Let 


Then 


y f^(y,v)dy 


yf(y,v)dy = 

r yr . 

■ ' J (V) 


yf,. (y,v)dy 
i 




X^fv (V) 
i 

i 


h(v) = In^h^Cv) 


Thus h(v) can be obtained from the h^(v) and the marginals (v). 
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APPENDIX G 

SPRING WHEAT/BARLEY LABELER 


G.l BACKGROUND 

The problem of separating spring wheat from other spring small 
grains, of which barley if by far the most prevalent, has emerged as 
one of the key problems in obtaining accurate estimates of spring 
wheat production [ 1 ]. In response to that need, research was begun 
at ERIM to devise a spring wheat /barley labeling technique. This work 
resulted in a first generation machine labeler described in Section G.2 
below and in Reference 3 . Evaluation of the deficiencies found in the 
first generation led to an approach and methodology for a refined 
labeler based upon a more sophisticated utilization of agronomic under- 
standing and Landsat information content. Progress achieved in this 
refinement is presented in subsequent sections of this Appendix. 

G.2 FIRST-GENERATION LABELER 

G.2.1 DESCRIPTION 

The machine technique assumes prior analyst-interpreter involve- 
ment of two types : labeling of spring small grains targets and determi- 

nation of suitability of the segment for the machine technique. A seg- 
ment's suitability is determined by the number and timing of acquisi- 
tions in the growing season for spring small grains , as will be des- 
cribed later. 

Data passed to the machine are assumed to be normalized by a 
series of preprocessing steps comprised of satellite calibration cor- 
rection, cosine sun angle correction, flagging of bad data, clouds, 
etc. [36], and haze correction [3?]* 
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Once targets have been identified as spring small grains and ac- 
quisition histories have been checked, the machine' carries out the 
the spring wheat/barley separation in two steps: estimation of crop 

calendar shift and assignment of labels. 

1. Estimation of Crop Calendar Shift : The first step in the 

machine process involves a technique by which signal variability due 

to differences in stage of development (as expressed in spectral appear- 
ance) is minimized. This technique, described in Reference 3 , also 
allows more accurate selection of critical acquisitions for the label 
assignment step. 

Crop calendar shift estimation utilizes a mathematical repre- 
'sentation (profile) of Tasseled-Cap Greenness development with a fixed 
time axis. Data for a particular target are shifted along the time 
axis until the best fit of data to the reference profile is obtained. 

The fixed time axis is then used as the new time axis for the target 
observations, and the difference between the original acquisition day 
axis and this fixed axis is the estimated crop calendar shift. The 
process requires that at least three acquisitions, 18 or more days 
apart, be available in the course of the spring small grains develop- 
ment cycle (approximately 90 days in length) . 

/ 

2. Label Assignment : Those targets for which a crop calendar 

shift were successfully estimated are passed on to the second phase of 
the process — label assignment.- The labeling logic is designed to ex- 
ploit observed' differences in the temporal-spectral development of 
spring wheat and barley in Tasseled-Cap Brightness-Greenness space. 
Figures G-1 and G-2 illustrate examples of spectral trajectories for 
the two crops. As the crops green up, they move along a "green arm" 
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FIGURE G-1. EXAMPLE SPRING WHEAT TRAJECTORY - 
SMOOTHED CURVE OF MEANS 
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to some maximum Greenness. Then they typically loop back toward the 
Greenness axis and show a reduction in Greenness accompanied by in- 
creasing Brightness. This movement away from the green arm corresponds 
to the ripening of the crops. It is during the course of ripening, and 
particularly around the dough stage of wheat development (around 11.2 
on the modified Feekes scale) , that barley tends to be found farther 
from the green arm than spring wheat. 

A reference line was defined which is perpendicular to the 
observed path of the two crops away from the green arm, and the dis- 
tance from this reference line is used to discriminate between the 
two crops. Reference 3 gives further details. In order to b'e 
labeled, targets must have an acquisition in an 18-day range between 
shifted days 186 and 203. 

G.2,2 TEST RESULTS 

The first generation machine labeler was tested on 28 LACIE blind 
sites from the Northern Great Plains spanning two crop years (Phases 2 
and 3). The results, described in detail previously [ 3 ], indicated 
that the technique worked well in the area encompassing the -segments 
used for development of the logic, but poorly in areas separated by 
some distance from those segments. This’ stratification is illustrated 
in Figure G-3. 

Two phenomena were consistently observed in the segments for which 
poor results were obtained. First, the errors were always errors of 
omission for spring wheat, l.e. , most of the spring wheat targets were 
mislabeled as barley. In fact, both crops had distance measures sig- 
nificantly greater than expected on any given day in the time period 
of expected separability. Second, the segments all fell in areas 
likely to experience drought conditions. 
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-^Training Site 


FIGURE G-3. ORIGINAL LABELER TEST RESULTS - DECEMBER 1978 



As a result of these tests, an effort was undertaken to revise the 
labeler so as to increase its geographic range of effectiveness. 

g.3 approach to refinement 

G.3,1 philosophy' 

Although the entities to which labels are assigned are picture 
elements or groups of picture elements, the actual entities of interest 
are the populations of plants whose spectral properties are represented 
in the pixels. If any labeling technique is to be successful, it must 
be the case that the primary Influences on pixel spectral values are the 
populations of plants themselves. When external effects such as sun 
angle and haze are eliminated or minimized, we must assume that .bio- 
logical factors are the main spectral drivers. Given this assumption, 
it is clear that labeling techniques must be firmly grounded in bio- 
logical and physical phenomena, in the behavior of and influences on 
the plant populations themselves. 

G.3. 2 STEPS 

In order to insure that the results of the labeling technique 
revision were biologically sound, a sequence of steps was followed, 
starting with the plants themselves and working out towards the 
sensor. 

1. Develop Hypotheses : In order to provide an initial focus 

for the research, a set of hypotheses must be formulated 
to answer the question of interest. These represent not 
a set of absolute constraints, but rather a set of poten- 
tially useful directions to pursue. 
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2. Identify Pertinent Physiological Relationships, Effects ; 

Using results of previous agronomic research, crop charac- 
teristics or responses to given conditions which are likely 
to affect spectral response can be identified. These char- 
acteristics and responses on the level of individual plants 
or small groups of plants are the driving factors in deter- 
mining the spectral characteristics of the plant population 
as represented in the picture elements . 

3. Model Canopy Reflectance ; Utilization of canopy reflec- 
tance models allows the spectral effects of relevant physio- 
logical or environmental changes to be predicted and studied. 
Models allow a stricter control, and at the same time a 
broader range of conditions than might be available in 
field data. 

4. Analyze Field Measurements : Physiological relationships and 

their spectral effects can be verified through the use of 
field reflectance data. These data provide the crucial link 
between simulation and the real world, and also between the 
ground and the satellite. Field data also aid in under- 
standing the range of natural variability of spectral appear- 
ance and influential biological and physical factors. 

5. Analyze Landsat Data ; Only after the previous steps have been 
completed does satellite data enter into the process. At this 
point, hypotheses have been supported or refuted at several 
levels, and the results expected in the Landsat data are 
already well understood. Thus, this step' is a means of 
confirming expectations rather than one of finding a way 

to do the task at hand. The way has already been defined. 

The Landsat data will only confirm its reliability or indi- 
cate a need to return to an earlier stage in the process for 
additional work. 
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G.4 RESULTS OF REFINEMENT 

G.4.1 HYPOTHESES 

Based on the distinct geographic separation of "good" and "bad" 
results (Figure G-3) , and the correlation of "bad" results with high 
drought-susceptibility areas, it was likely that moisture stress was 
altering crop signatures and thereby causing spring wheat to look 
like barley. The geographical separation could also suggest dif- 
ferences in soil brightness. One of these factors, or a combination 
of them, were thought likely to be responsible for the labeling prob- 
lems encountered. 

G.4.2 PHYSIOLOGICAL RELATIONSHIPS AND EFFECTS 

Since soil brightness per se has no physiological effect on the 
plants, and associated soil properties would primarily be reflected 
in moisture availability, only moisture stress effects were studied 
in this stage of the labeler revision effort. 

The clear indication from a fairly wide range of research related 
to the impact of inadequate moisture on small grains is that effects 
vary considerably as a function of the stage of crop development at 
the time of stress initiation and the duration of the stress . In 
areas such as those where poor labeler results were obtained, prolonged 
moisture stress is likely. The effects of prolonged stress on small 
grains plants may include reduction in plant height and number of 
tillers per plant, thinner and smaller .leaves which may roll or wilt,, 
particularly in the middle portion of the day, reduction in the number 
of plants per acre, and increased rate of plant development. These 
effects stem from two major factors: a need to reduce surface area 

from which moisture may be lost through evapo transpiration and re- 
duced survival rates for the plant populations. 
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G . 4 . 3 REFLECTANCE MODELING 

Approach ; Reflectance modeling results were obtained using a 
three-layer version of the Suits' canopy reflectance model [21,59]. 

A set of "normal" parameters based in field measurements of wheat 
plants were modified to simulate the physiological effects of moisture 
stress. These modifications included changes in leaf orientation, 
leaf color, plant height and density, and relative depths and densi- 
ties of canopy layers. Parameters were determined for seven stages 
of development for both the normal and stressed conditions, as indi- 
cated in Table G-1. Both canopies at all stages of development were 
combined with three different soil spectra corresponding to dark, 
medium, and bright soils [60]. Landsat band reflectances were computed 
utilizing sensor spectral response functions , and these reflectance 
data were then transformed into a Tasseled-Cap-like projection. 

Results ; The trends apparent in the model data indicate signifi- 
cant and distinguishable effects of moisture stress and soil Brightness 
on canopy reflectance. As illustrated for normal canopies in Figures 
G-4 and G-5, soil Brightness exerts little if any significant influ- 
ence on Greenness, but considerable influence, as expected, on Tasseled- 
Cap Brightness. The effect on Brightness is most pronounced at the 
tails of the profile, where canopy closure is at a minimum, resulting 
in maximum soil exposure. Indeed, soil effects are all but eliminated 
when canopy closure is at a maximum. 

Figure G-6 illustrates for the normal canopy the distance measure 
used in labeling, plotted over time. The descending portion corre- 
sponds to a change from bare soil to vegetated soil, while the flat 
middle portion corresponds to the greening-up of the crop (movement 
along the green arm) . The later ascending portion represents the 
ripening of the crop, and includes the time period in which spring 
wheat and barley are most distinguishable (Section G.2.1). 
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TABLE G-1. STAGES OF DEVELOPMENT MODELED IN CANOPY 
REFLECTANCE ANALYSIS 


Stage # 

Name 

Approx. 
Day of Year 

Characteristics 

1 

Emergent 

131 

Detectable Greenness 

2 

Jointing 

158 

Stem elongation occurring, 
considerable green vege- 
tation 

3 

Boot 

167 

Just prior to heading, 
increasing component of 
mature green parts 

4 

Post-head 

185 

Heads present, mature green 
component dominant , lower 
leaves dead, entire plant 
not yet turning 

5 

Senescing 

194 

Entire plant turning 
yellow/brown 

6 

Ripe 

203 

No green matter remaining, 
plants dead 

7 

Harvested 

221 

Stubble only 
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FIGURE G-4. CANOPY REFLECTANCE MODELING RESULTS, 
SOIL BRIGHTNESS EFFECTS ON GREENNESS - 
NORMAL CANOPY 
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FIGURE G-5. CANOPY REFLECTANCE MODELING RESULTS 
SOIL BRIGHTNESS EFFECTS ON BRIGHTNESS - 
• NORMAL CANOPY 




FIGURE G-6. -CANOPY REFLECTANCE MODELING RESULTS, 
SOIL BRIGHTNESS EFFECTS ON DISTANCE MEASURE - 
NORMAL CANOPY 
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Since the distance measure is a linear combination of Greenness 
and Brightness, changes in those two components as a result of soil 
brightness variation or any other factor will be reflected in changes 
in the distance profile. Thus, soil brightness effects on the distance 
measure are similar to those on Brightness , occurring primarily at the 
times when canopy closure is low. The primary effects of bright soil 
seem to be an increased slope in the non-level portions of the distance 
profile and a positive offset of all distance values. 

While Greenness is little affected by soil brightness, it is sig- 
nificantly impacted by moisture stress, as illustrated in Figure G-7. 

A considerable effect was also seen in Brightness, as demonstrated by 
comparison of Figure G-8 to Figure G-5; but here the effect is primarily 
an enhancement of soil influences resulting from reduced canopy closure. 
As a result, the impact of moisture stress on Brightness cannot be pre- 
dicted without knowledge of the soil brightness. 

Again, the effect of moisture stress on the distance profile is 
a combination of Greenness and Brightness effects, as illustrated by 
comparison of Figure G-9 to Figure G-6. As in the Brightness profile, 
soil brightness effects are enhanced with the more open stressed canopy. 
The overall reduction in Greenness values is translated into a general 
increase in the distance measure, and a reduction or near elimination 
of the level middle portion of the profile. Of greatest probable sig- 
nificance to labeling is the fact that the ascending portion of the 
distance profile, the portion containing the time period of maximum 
spring wheat/barley separability, comes sooner than it does for the 
normal canopy . 
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FIGURE G-8. CANOPY REFLECTANCE MODELING RESULTS, 
SOIL BRIGHTNESS EFFECTS ON BRIGHTNESS - 
STRESSED CANOPY' 
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FIGURE G-9. CANOPY REFLECTANCE MODELING RESULTS, 
SOIL BRIGHTNESS EFFECTS ON DISTANCE MEASURE - 
STRESSED CANOPY 




Summary : The model data thus suggest that both moisture stress 

and soil brightness exert detectable influences on canopy reflectance 
and, perhaps most importantly, on the distance profile in the time 
period of separability. Neither hypothesis can be rejected as a 
possible cause for the experimental results obtained previously. 

Finally, the two factors should be distinguishable and detectable 
by separate processes. Moisture stress should be expressed in a re- 
duced Greenness peak, while soil brightness should be expressed in 
early season Brightness values. 

G.4.4 FIELD MEASUREMENTS 

In this study, field measurements were used primarily in a support 
role. Field data were used to provide normal canopy inputs for the 
reflectance model. In addition, on-site observations of field experi- 
ments being conducted under the direction of Dr. Ray Jackson at the 
U.S. Wafer Conservation Laboratory, Phoenix, Arizona, served to substan— 
tiate the physiological effects of moisture stress reported by agronomic 
researchers. These on-site observations thus aided in the understanding 
of the physiological phenomena involved, and their spectral impacts. 

G.4.5 LANDSAT DATA ANALYSIS 

Data : The data used in verifying the expected impact of moisture 

stress and soil brightness variations and revising the labeler to 
accommodate those impacts consisted of observations of small grains 
targets from seven LACIE blind sites, a subset of the original test 
set (see Section 1.2). Table G-2 provides additional information on 
the segments, and Figure G-10 shows their locations. 
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TABLE G-2. 

Segment 

1498 

1515 

1640 

1663 

1669 

1800 

1929 


SEGMENTS USED IN LANDSAT DATA ANALYSIS 
Location 

Codington County, South Dakota 
Norman County, Minnesota 
Barnes County, North Dakota 
Richland County, North Dakota 
Perkins County, South Dakota 
McCook County, South Dakota 
Blaine County, Montana 
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PIGDRE G-10. SEGMENTS USED IN LANDSAT DATA ANALYSIS 



Expected Results : Based on the previous steps in the research 

process, the following relationships were expected to be observed in 
the Landsat data. 

Moisture stress of a prolonged nature should be detectable in a 
reduced peak in the Greenness profile for small grains in a segment. 

As a result of the impact of moisture stress on the distance profile, 
the time period during which spring wheat and barley are distinguishable 
should occur earlier than normal. 

Soil brightness variations should be manifested in early season 
Tasseled-Cap brightness values (i.e., before significant growth of 
green vegetation has occurred). An increase in the slope of the dis- 
tance profile during the time period of separability should also be 
observed and as a result a more steeply-sloped decision line would be 
expected. In addition, moisture stress and bright soils should interact 
by enhancing the soil effects, resulting in higher distance values 
during the time period of separability. 

■ Test Results ; Two iterations of data analysis were carrie'd out, 
intended to provide a rough-cut revision of the original labeling logic, 
followed by a more precise revision. 

The first iteration utilized data from six segments. Crop calendar 
shift was estimated using a two-step refinement of the original tech- 
nique. After a shift estimate was obtained using a standard profile, 
a new profile was computed using all small grains pixels (shifted) in 
the segment. This segment-specific profile was then used to refine 
the first shift. While the difference between data shifted with the 
segment-specific profile and data shifted with the standard profile 
is far less dramatic than that between shifted and unshifted data, the 
segment-specific profile shift did tend to produce smoother distribu- 
tions over time. In addition, fitting a profile to segment data pro- 
vided a means of obtaining a segment-level peak Greenness estimate. 
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The apparent time period for separability of wheat from barley 
(using the Greenness-Brightness distance) was determined through a 
combination of quantitative and qualitative methods. Optimum linear 
discriminants were computed for each 3-day interval from peak Green- 
ness through a time at or near harvest. These results were then -com- 
bined with visual evaluation of the distance measure plotted against 
shifted day of year, and used to produce estimates of the day range of 
separability. Optimum linear discriminants were again computed, this 
time for the defined day range as well as for an 18-day interval com- 
prising a subset or expansion of the defined range. 

Figures G-11 through G-14 illustrate the results of this first 
iteration, and indicate support for the hypothesized relationships. 
Segments with low green peaks (1669 and 1929) have earlier day ranges 
of separability. These same segments also had very bright soil, and 
this was reflected 'both in the starting distance value for the linear 
discriminant and the slope of the decision line. 

Based on the sparse data available, a preliminary labeling logic 
revision was devised, and tested on the six training segments. Table 
G-3 illustrates the ‘labeling steps used, while Table G-4 and Figure G-15 
show the results. Noteable in the results is the -fact that labeling 
performance on problem segments was distinctly .improved, while labeling 
performance on good segments was not degraded. 

The second iteration of Landsat data analysis was intended to 
bring in a significantly larger data base from which relationships 
could be more precisely defined. Data processing and availability 
problems precluded this approach, however, and only one additional 
segment was added. 

Nevertheless, the data' from the seven segments were reprocessed, 
this time using the new profile model (Appendix A) .in the segment- 
specific shifting procedure. In addition, linear estimates of the 
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FIGURE G-11. OPTIMUM DECISION LINES - ITERATION 
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FIGURE G-12. CORRELATION OF ESTIMATED PEAK GREENNESS WITH FIRST DAY 

OF SEPARABILITY - 1st ITERATION 
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FIGURE G-13. CORRELATION OF ESTIMATED SOIL BRIGHTNESS WITH 
DECISION LINE SLOPE - 1st ITERATION 
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FIGURE G-14. CORRELATION OF ESTIMATED SOIL BRIGHTNESS WITH STARTING 
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Terim 


TABLE G-3. STEPS USED IN APPLICATION OF PRELIMINARY 

LABELING LOGIC 


Data Set: Small grains pixels (identified from 

ground truth tapes). Data correction and 
normalization, and Tasseled-Cap transfor- 
mation applied. 


Step Description 

1 Divide segments into two bins based on estimated 
peak Greenness 

2 Utilize linear regression estimate of soil 
brightness/decision line slope relationship 
to estimate decision line slope. Default 
slope is .53 (if no soil brightness data 
available) 

3 Compute decision lines : 

low peak bln - 

starting day =172 

' y-intercept = 37.25 - slope * 170.1 
(indicates standard "pivot point" 
from which slope can be calculated) 

high peak bin - 

starting day = 186 

y-intercept = 37.25 - slope * 191.1 

4 Assign labels 
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TABLE G-4. . DETAILED LABELING RESULTS OF LANDSAT DATA ANALYSIS - 1®^ ITERATION 





Original Labeling 

Optimum Linear 

Revised Labeling 



Procedure 

Discriminant 

Procedure 

(Prelira. ) 

Segment 

Ground Truth 

SPW/OSMG 

% .Correct 

SPW/OSMG ; 

'i Correct 

SPW/OSMG 

% Correct 

1498 

Spring Wheat 

806/239 

77.1 

846/220 

79.4 

802/264 

75.2 

Barley 

114/310 

73.1 

134/283 

67.9 

116/301 

72.2 

1515 

Spring Wheat 

2597/449 

85.3 

2805/551 

83.6 

2795/561 

83.3 

Barley 

421/2033 

82.8 

737/2022 

73.3 

801/1958 

71.0 

1640 

Spring Wheat 

4593/971 

82.5 

4493/1077 

80.7 

4601/969 

82.6 

Barley 

909/1363 

60.0 

770/1497 

66 . 0 

822/1445 

63.7 

1669 

Spring Wheat 

11/384 

2.8 

231/51 

81.9 

242/40 

85.8 

Barley 

10/261 

96.3 

75/102 

57.6 

81/96 

54.2 

1800 . 

Spring Wheat 

41/34 

54.7 

55/20 

73.3 

44/31 

58.7 

Barley 

133/871 

86.8 

315/679 

68.3 

278/716 

72.0 

1929 

Spring Wheat 

4/1346 

0.3 

^ 678/312 

68.5 

662/328 

66.9 

Barley 

1/545 

99.8 

147/232 

61.2 

139/240 

63.3 





new model parameters were obtained for each of the small grains pixels 
in the segments' (about 33000 total) . These pixel-specific profiles 
provided a means of eliminating pixels with distinctly non-small-grain- 
like trajectories (ground truth discrepancies, etc.) and so resulted 
in a purer data base of small grains observations. About 4000 pixels 
were eliminated by this process. In addition, the pixel-specific para- 
meter estimates made possible, an analysis, not yet completed, of pixel 
spectral behavior as a function of estimated Greenness peak. 

As in the first iteration, day ranges of separability were defined 
for good small grains pixels, using -quantitative and qualitative analysis 
techniques , and optimum linear discriminants were computed in those 
ranges and in corresponding 18-day intervals. 

These decision lines are illustrated in Figure G-16, and show 
much the same trends as did the first set. The particular trends of 
interest are illustrated in Figures G-17 through G-19. 

The one discrepancy between the two iterations involves the soil 
brightness/decision line slope relationship. While clear in the first 
iteration, this relationship is not apparent in the second. Small 
changes in slope had little impact on labeling accuracies in either 
case, suggesting that this particular element of the labeling logic 
is less important than, for example, the time period in which sepa- 
rability occurs. Nonetheless model results and our understanding of 
the physical processes involved suggest that the relationship should 
be a factor, and further work aimed at more' conclusively supporting 
or rejecting the hypothesis is desirable. 

g.4.6 procedural specification of refined labeler 

Although precise tuning of the labeling logic was not possible 
due to data problems, the procedural steps in the refined small .grains 
labeler have been defined, at least in a preliminary context. The 
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FIGURE G-16, OPTIMUM DECISION LINES - ITERATION 
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FIGURE G-17. CORRELATION OF ESTIMATED PEAK GREENNESS WITH FIRST, DAY 

OF SEPARABILITY - 2nd ITERATION 
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FIGURE G-18. CORRELATION OF ESTIMATED SOIL BRIGHTNESS WITH 
DECISION LINE SLOPE - 2nd ITERATION 
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CORRELATION OF ESTIMATED SOIL BRIGHTNESS WITH STARTING 
DISTANCE VALUE - 2^^ ITERATION 
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functions resulting from analysis of a more extensive data set would 
almost certainly' vary from those here defined, but the forms of the 
preliminary functions do have strong support in physical understanding. 
Thus while the labeling logic as specified may fail to do justice to 
the concepts on which it is based, it does provide a benchmark, and a 
means by which the underlying concepts may be both understood and in- 
corporated into test and evaluation exercises. 

The refined labeling logic allows for three modifications to the 
orientation of the decision line to accommodate moisture stress and 
soil .brightness effects. Of these three, two have been specified. 

The third, modification of the decision line slope in response to soil 
brightness variations, could not be supported with the available data. 

Thus in this procedural specification, a standard slope of O'. 61 is used. 
This is the, slope of the decision line in the original labeler, arid 
represents something of a median value for slopes computed in the 
second iteration of Landsat data analysis. 

The relationship between estimated peak Greenness and the start 
of separability is defined as Illustrated in Figure G-20. 

The data points support but do not prove the validity of the des- 
cribed function. There is, however, physical basis for the relation- 
ship depicted. Above a Greenness peak representing the low end of 
"normal" moisture conditions one would expect to find crops of increasing 
leaf area or, qualitatively, "lushness", perhaps as a result of increasing 
abundance of moisture, (up to the point at which the over-abundance of 
moisture itself became stressful). It would not be expected, however, 
that these crops would exhibit a pronounced lengthening of the growing 
cycle, since climatic constraints quickly come into play. Normal and 
above-normal crops should develop at the same rate, and so have the 
same time period for separability. 
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FIGURE G-20. FUNCTION USED TO PREDICT STARTING DAY OF SEPARABILITY 

FROM ESTIMATED PEAK GREENNESS 



The impact of moisture conditions on start of separability is felt 
at below-normal moisture levels where stress triggers an increase in 
rate of development- This is reflected in the sloping portion of the 
function. There must also, however, be some limit on how much develop- 
ment can be speeded up, or how much stress can be tolerated. This is 
reflected in the vertical portion of the function at extremely low peak 
values. 

The relationship between soil brightness and the initial distance 
value of the decision line is defined as shown in Figure G-21. Modeling 
results indicated that this relationship only comes into play in stressed 
(more open) canopies. A peak value of 52.5 is defined as being the 
minimum peak of a normal canopy. For segments with a lower peak esti- 
mate than 52.5, the function is utilized. 

Again, support can be found both in the data and in physical under- 
standing, but precise definition with the sparse data was not possible. 
The vertical portion of the function reflects the fact that some limit 
is imposed on how small the distance value can become. Since label 
assignment takes place after the crops have begun to move off the green 
arm, they must be at a distance from the reference line greater than 
that of the green arm itself. Even a very dark soil would not be 
expected to shift the data past the green arm. The actual value of 
the vertical portion (Distance = 33.0) is simply based on the apparent 
clustering of the darker-soil segments around this value. 

Given these functional relationships, the procedure for labeling 
spring small grains is as follows: 

1. Distinguish spring small grains from non-small grains by some 
unspecified technique. 
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FIGURE G-21. FUNCTION USED TO PREDICT STARTING DISTANCE VALUE FOR DECISION LINE 

FROM ESTIMATED SOIL BRIGHTNESS 
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2 . If not already done, apply data normalization techniques 
comprised of : 

a. satellite calibration 

b . cosine sun angle correction 

-c. bad data, cloud, etc. detection (ERIM' s -SCREEN) 
d. haze correction (ERIM's spatially-varying XSTAR) 

3. Carry out manual screening of segments: 

a. Acquisition history requirements: 

1) At least three acquisitions (at least 18 days 
apart) in the spring small grains growing 
season (typically May through July) 

2) One acquisition around the time of turning 
(typically mid- July) 

b. Presence of unusual phenomena - clearly anomalous 
conditions such as wide-spread abandonment, pre- 
maturity harvesting, etc. 

c. If acquisition history is inadequate or unusual 
phenomena are observed, spring wheat/barley labeling 
is not attempted. 


4. Estimate crop calendar shift using the two-stage segment- 
specific technique: 

a. Choose that shift which maximizes the cross- 
correlation term: 


R = 


1 + 




1+T 






i+T^i^ 


2 - 


where = reference profile function value 

G. = data value 
a. 

X = shift value 
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b. Make a first estimate using a reference profile 
of the form: 

F(t) = at^e*^^ 

where F(t) = Greenness - 25. 

t = shifted day of year - 125. 
a = 0.65163 
h = 1.2957 
c = -0.52415 X lO"^ 

The profile is calculated f.or offset days 1 
through 120. In addition, a 30-day tail is added 
to the early side of the profile, with a constant 
value equal to the value at offset day 1. 


c. Make a second estimate using a profile of the form: 


F(t) 


ae 


h (t-t 

1 p 


t < t 

P 


ae 


b^Ct-tp)' 


t >L t 

P 


where F(t) = Greenness - 25. 

t = shifted day of year 

t = shifted day of peak = 160. 
P 

a,b^,b 2 = parameters 


The parameters a, b^^, and b^ are estimated for 
each segment using linear regression on small-grain 
blob means after the first shift has been applied; 
note that 

a + 25. = estimated peak Greenness 

This parameter reflects the use of data that have been Tasseled-Cap 
transformed and to which an offset of 32 counts has been added to 
eliminate negative numbers. 
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5, Compute segment-level mean Brightness in shifted days 110-120 
for all available small grains pixels (this is soil Brightness 
diagnostic) . 

6, Modify the decision line, based on estimated segment-level 
peak Greenness and soil Brightness , in the manner specified 
in Figures G-20 and G-21. 

Default values are: - 

a. Slope = 0,61 

b. Starting Day = 186 

c. Starting Distance = 33. 

7, Assign labels: barley if above line, wheat if below. 
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APPENDIX H 

DESCRIPTION OF DEVELOPMENT OF THE LANDSAT 3 TO LANDSAT, 2 
CALIBRATION -CORRECTION 

In order to carry out analysis procedures onLandsat data in such 
a way that the procedures act in a consistent manner regardless of which 
Landsat collects the data, a normalizing transformation must be carried 
out so that the' calibration of counts to received signal level is con- 
sistent. At ERIM, work was previously completed and reported [38] for 
adjusting the calibration of Landsat 1 and 2 data to a standard. The 
standard was that calibration used for Landsat 2 segment data used with- 
in LACIE during its first three years-. This section discusses the work 
recently completed to determine and test a similar normalizing transform- 
mation for Landsat 3 data. 

The criterion used as a goal in developing the transformation was 
that' the behavior of algorithms SCREEN and XSTAR [36, 37] on the resulting 
transformed data must be equivalent to their behavior on Landsat 2 data. 
For this to happen, the processed Landsat 3 data would need to exhibit 
a stable location in spectral space for the overall pattern of scene 
pixels (e.g., a stable Tasseled Cap [12]), -and the location of the pat- 
tern would be the same as for Landsat 2 data, 

H.l DEVELOPMENT , - ■ 

The first step in developing the transform was to prepare a" data 
set consisting of acquisition pairs in which one of each pair is Landsat 
2 data, and the other is Landsat 3 data over the same site, exactly nine 
days earlier or later (since the satellite orbits are nine days apart, 
and consistent position within a full frame was desired) . 
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The available data consisted of Landsat acquisitions, obtained from 
Landsats 2 and 3 throughout the 1978 spring wheat growing season, over 
13 5x6-miles sites in the Northern U.S. Great Plains. From this base, 

31 acquisition pairs were prepared. Care was taken to avoid acquisitions 
with a significant amount of clouds or water, and to avoid pairs for 
which the haze levels or scene content were extreme. 

All acquisitions in the 31 pairs were processed using algorithms 
SCREEN and PFEAT. SCREEN, which recognizes and flags cloud, shadow, 
water, and unusual pixels, was applied to the Landsat 3 data even though 
it was believed . that the calibration was not optimum for SCREEN. This 
was done since a visual screening eliminated acquisitions that had many 
such pixels, and since SCREEN still succeeded in flagging some unwanted 
points and false flagging presented no major difficulty. 

Algorithm PFEAT processed the remaining unflagged pixels and ex- 
tracted features used in subsequent analysis. The features consisted 
of a "green arm mean" and a "soil arm mean", each of which contain four 
components. These features, defined below, previously have been found 
to be relatively stable within the "Tasseled Cap" (the pattern of agri- 
cultural pixels plotted in Landsat signal space) , as long as the scene 
contains crops at a variety of growth stages (even if the portion of 
the scene at a given growth stage changes). To calculate these fea- 
tures, one applies first the cosine sun angle correction, and next 
the Tasseled Cap transformation. The features then are computed as 
shown in Table H-1. Once the two feature vectors were extracted by 
PFEAT, they were averaged to form a single scene diagnostic for each 
acquisition. Independently with each Landsat band, a regression was 
carried out of the scene diagnostic for Landsat 3 acquisition with the 
diagnostic for the paired Landsat 2 acquisitions. 
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TABLE H-1. FEATURES COMPUTED BY PFEAT 


Component 


"Soil Arm Mean" 


"Green Arm Mean" 


1 


2 

3 

4 


Mean Brightness for pixels 
in soil window (Greenness 
between +10 and -10) . 


Fifth percentile of Green- 
ness for all pixels. 

Fifth percentile of Yellow 
for all pixels. 

Mean of Nonsuch for pixels 
in soil window. 


Brightness value associated 
with a point at fifth per- 
centile of D (= .68* Bright- 
ness - .73* Greenness) and 
mean of E (=.73" Brightness 
+ .68" Greenness) for pixels 
below fifth percentile of D. 

Greenness value associated 
with the above point. 

Mean of Yellow for pixels 
below fifth percentile of D. 

Mean of Nonsuch for pixels 
below fifth percentile of D. 
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In examining the results of this initial regression, it was found 
that the residuals of ten of the pairs were unusually large. These 
pairs were excluded from the data set in preparing the final transfor- 
mation. But reasons for the excessive residuals were sought. A scat- 
ter plot of the average Tasseled-Cap Yellow component (which has been 
shown [12] to be related to the haze level in the scene) indicated that 
for at least half of the high-residual pairs, the haze level was not the 
same, and therefore those pairs were measures as much of haze difference 
as of calibration difference. Those pairs not explained by haze condi- 
tion are suspected to have some other but still undetermined scene ef- 
fects that caused confounding signal differences between acquisitions 
in a pair. 

The remaining 21 pairs were used to determine a final transformation. 
Two models for the transformation were considered: the first was the 

simple multiplicative and additive transformation in each band that had 
been successfully used for other satellite calibration correction trans- 
formations; the second was pure multiplicative in each band. In examin- 
ing the regression results, it was found that the additive constants vrere , 
relatively small in the first model, and furthermore that there was no 
important difference in the goodness of fit in any band, and so the 
pure multiplicative model was accepted. 

The transformation determined by this effort is given below: 



1.1371 

0 

0 

0 


0 


0 

1.1725 

0 

0 


0 


0 

0 

1.2470 

0 

X + 

0 


0 

• 0 

0 

1.1260 


0 


where x is the untransformed Landsat 3 signal vector 
x' is the transformed signal vector. 
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The following three steps were taken to determine how well this 
transform works. 

First, corrected Landsat 3 acquisitions were processed using the 
SCREEN algorithm. The resulting data was mapped and examined for incor- 
rect screening. .Bodies of water flagged by SCREEN were found to have 
the same general size and shape as the same bodies of water flagged by 
SCREEN in Landsat 2 acquisitions over the same site. Indications of 
incorrect operation of SCREEN were sought, including whether especially 
light or dark fields had an increased tendency to be flagged on Landsat 3 
data. It appeard, at least in the Northern IF.S. Great Plains, that 
SCREEN operated normally on Landsat 3 data transformed by the above 
equation. 

Secondj Landsat 2 and corrected Landsat 3 acquisitions were processed 
using SCREEN, and scatter plots were made of pairs of Tasseled’ Cap com- 
ponents. A subjective visual examination of this data indicated that 
with the correction, the position of features within the Tasseled Cap 
pattern (such as the "soil line") was relatively stable between the two 
satellites . 

And third, an objective examination of the performance of XSTAR on 
calibration corrected Landsat 3 data was carried out. Several consecu- 
tive day Landsat 3 acquisition pairs were identified and processed 
through calibration correction, SCREEN, XSTAR, and Tasseled Cap. As 
Indicated above, the data patterns looked right qualitatively. The aver- 
age difference (RMS error) between data on consecutive days was computed' 
on a pixel-by-pixel basis. Since these same measures were previously 
used in the original XSTAR evaluation [3], the expected range was known. 
The values computed fall within the previously established normal range, 
indicating that XSTAR was working as well as it normally does. 

Given the above three indications, we feel that the transformation 
has proven in its self reasonably well, at least in the region for which 
Landsat 3 data was available in this study. 
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APPENDIX I 

DESCRIPTION OP TOLERANCE BLOCK STRATIFICATION 


This Appendix motivates and describes a study whose purpose was 
to find improved statistical methods of spectral stratification in the 
context of Procedure M, a system for estimating the acreage of an ag- 
ricultural crop, such as wheat, from digitized Landsat data [ 3 ] . 

The development of this procedure was stimulated and supported by 
the Large Area Crop Inventory Experiment (LACIE) . Results are pre- 
sented in Section 5.2. 

Procedure M as applied to wheat recognition stratifies spectral 
data by: 

1. Clustering the pixels into field-like groups called "quasi- 
fields" that are homogeneous spectrally and spatially. 

2. Clustering the quasi-fields spectrally into strata. 

Step 2, the clustering of quasi-fields into strata, is designed 
to separate wheat from non-wheat strata and thereby achieve a sampling 
efficiency. 

The grouping of pixels into quasi-fields has been largely suc- 
cessful. Figure I-l is a histogram of the percent wheat in quasi- 
field interiors. (The interiors consist of pixels faced on all four 
sides by pixels from the same quasi-field) . This histogram was com- 
piled over all quasi-fields that have interiors from 12 Kansas seg- 
ments, three acquisitions each. Most of the quasi-fields have less 
than 10% or more than 90% wheat. Between 10% and 90% wheat, there is 
only a small scattering of quasi-fields. 
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FIGURE I-l. HISTOGRAM OF PERCENT WHEAT IN QUASI-FIELD 
INTERIORS FOR 12 KANSAS SEGMENTS 
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The picture would not be as pretty if we included edge pixels 
(i.e., those that are not interior) in the quasi-fields but we would 
not expect it to be. Edge pixels are often crossed by field bounda- 
ries and are the ones that suffer most from misregistration. 

The corresponding histogram for strata (Figure 1-2) shows some 
mixing of wheat and non— wheat quasi-fields . To make this histogram 
comparable to the other, the stratum count is weighted by the number 
of quasi-fields in each stratum.' Also for comparability, the histogram 
is based on quasi-field interior ground truth. So whatever fuzziness 
is in this histogram is not caused by edge pixels. 

A big group of non-wheat quasi-fields are put together into rela- 
tively pure strata. The group is not as big as in the quasi-field 
histogram, for when we compare the two figures, we see that some of 
the 0 to 10 percent quasi— fields in the quasi— field histogram have 
spilled over into the 10 to 20 and 20 to 30 percent bins in the stratum 
histogram. Similarly, the stack of wheat quasi-fields is spread out 
into the 80 to 90 and the 70 to 80 bins. 

The stratification was carried out by our unsupervised clustering 
algorithm BCLUST [ 9 ]. The question we are considering is whether 
stratification can be improved by a better clustering algorithm. 

One problem with BCLUST is its tendency to produce a few large 
clusters and many small ones. Figure 1-3 shows a typical distribution 
of pixels in a 40-cluster stratification. We try to sample in propor- 
tion to the size of the strata because this is the best rule when the 
stratum wheat proportions are unknown. But in the BCLUST stratification 
the big strata are multiple sampled and many small strata are unsampled. 
Leaving the small strata out would create a bias, so we combine the zero 
allocation strata into one wastebasket stratum and sample from it pro- 
portional to size. (But we require at least one quasi-field in the 
sample.) We cannot expect that this wastebasket stratum will be pure, 
so the sampling from it is inefficient. 
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FIGURE 1-2. HISTOGRAM OF PERCENT WHEAT IN SPECTRAL 
STRATA FOR 12 KANSAS SEGMENTS 



FIGURE 1-3. PIXEL DISTRIBUTION FOR BCLUST STRATA 
■ . SEGMENT 1165 
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The large strata do not have sampling problems if they truly 
separate wheat from non-wheat. But if they are so large that they 
mix up the wheat and non-wheat quasi-fields then it would be better 
to divide them further into small strata, more localized spectrally 
and more homogeneous with respect to crop type. 

A good clustering algorithm that produced more uniformly— sized 
strata might improve on the stratification performance of BCLUST. 

An approach to defining a clustering algorithm producing equal- 
sized clusters is the use of tolerance blocks. "Tolerance blocks" 
are equally-populated regions of spectral space constructed as follows . 
We decide on a small number of channels, t^,...tj^, to generate the 
blocks. We consider the first channel t^ and order all the quasi- 
fields according to this channel. We separate this' ordered group of 
quasi— fields into n^ equal-sized subgroups — equal in the sense of 
having approximately the same number of pixels (Figure 1-4). Then 
we consider each subgroup in turn, order it according to our next 
channel t^, and divide it into smaller subgroups (Figure 1-5). 

We can now consider each one of the smaller subgroups, order it 
according to our third channel t^, and divide it into n^ still' smaller 
subgroups. We keep this up for all the generating channels specified. 
The final subgroups are the tolerance blocks, n^, n 2 ,...n^ in all. 

Not all channels need be included in this process. If the same 
set of channels is used in a different order, the tolerance blocks pro- 
duced are not necessarily the same. (The results, however, were very 
similar in our tests.) Wlien channel t 2 is used to divide the first set 
of subgroups, the points of division will, in general, be different from 
subgroup to subgroup (column to column in Figure 1-5). Because we don’t 
cut any quasi-fields in half, but rather assign them to one subgroup or 
another, the equality of the pixel size of the subgroups can only be 
approximate. 
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(The cuts in Channel separate the quasi-fields 
into five regions of nearly equal pixel size.) 


FIGURE 1-4. FIRST CUT TO CREATE TOLERANCE BLOCKS 



(The columns are equal-sized groups of quasi-fields sepa- 
rated by cuts in Channel t^^. The rectangles are equal- 
sized groups of quasi-fields separated by cuts in Channel t2-) 

FIGURE 1-5. FIRST AND SECOND CUTS TO CREATE TOLERANCE BLOCKS 
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The blocks are spectrally homogeneous with respect to the generating 
channels. How homogeneous they are depends on the number of divisions 
in each channel. But because the number of blocks is the product of 
the number of divisions, the number of divisions in each channel must 
be small if we are to end up with a reasonably small number of blocks. 

So spectral homogeneity of tolerance blocks is limited in two ways. 

Some channels are left out of the; block construction and those that 
are represented may have coarse divisions. 

In order to achieve a greater spectral homogeneity, we defined a 
second tolerance block algorithm that uses all the spectral channels 
in the clustering process. The tolerance block means are used as seeds 
distributed like a network throughout spectral space. Around the seeds, 
clusters are formed by ordinary spectral clustering using a distance 
function. .Although a subset of channels may have been used to create 
the blocks ,, all channels are used to compute the block means and carry 
out the clustering. We hoped to combine in one algorithm the virtues 
of uniformly-sized clusters and spectral homogeneity. 

How -well the tolerance block algorithms have succeeded in equalizing 
the clusters can be seen in Figure 1-6, a comparison of distributions of 
strata sizes produced by the three algorithms, BCLUST has a very uneven 
distribution as we have seen. Many clusters have only a very small num- 
ber of pixels. When the tolerance blocks themselves are used as clus- 
ters, the distribution is very even. When the tolerance blocks are 
used as- seeds, the distribution is less even than for the blocks but 
considerably more even than for BCLUST. 
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FIGURE 1-6 .■ PIXEL DISTRIBUTIONS FOR THREE CLUSTERING ALGORITHMS 

’ SEGMENT 1165 



APPENDIX J 
ERROR MODEL STUDIES 


This appendix contains investigations of the effect of labeling error 
on the error in the proportion estimate. Section J.l is an analysis of 
the effect of labeling error on the bias of the estimate in a simplified 
situation. Section J.2 presents a. model for predicting bias and variance 
of -Procedure M stratified sampling estimates when there are labeling errors. 
Classification errors are assumed constant in this model. In Section J.3, 
the results are extended to the case of distributed labeling errors. In 
both J.2 and J.3 the simplifying assumption is made that blobs are sampled 
and estimates made without regard to the number of pixels in the blob. 

This assumption permits a manageable expression' for variance. Finally, 
Section J.4 relates labeling error to achieving some level of performance, 
as is exemplified by the 90/90 criterion. 

J.l LABELING ERROR AND ESTIMATION BIAS: SIMPLE MODEL 

A simple model is described and a corresponding estimation problem 
analyzed which dramatizes the effect of labeling error on proportion esti- 
mation. 

We assume a' random sample from an infinite population that contains 
two classes of objects, wheat and other. The proportion p of wheat is 
.'Unknown. We don't observe the class to which the points in the sample 
belong, but the points are labeled (by analyst-interpreters or any other 
method) and we observe the labels only. It is assumed that the labeling 
process introduces error as follows. If a sample point is in class W 
(wheat), a is the probability that it is correctly labeled. If a point 
is in class 0 (other), 3 is the probability it is incorrectly ' labeled. 

The only knowledge we have about a is that it lies in some subinterval 
of the unit interval, i a g' similarly 3 lies in some interval, 

3^ £ 3 ^ B 2 ' The problem is to estimate p. 
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In formal terms, let 



be N independent observations of a random variable X with 
P^(X=1) = pa + (l-p)e 
P^(X=0) = 1 - P^(X=1). 


The parameters a, g, and p lie in the intervals, a^ £ a ^ a 2 » 3^^ ^ ^ S 2 > 

0 g p < 1. The problem is to estimate p. 

Let 

p = pa + (l-p)3 
N 

Then 'l X. has a binomial distribution with parameters (N,p). If we knew p 
1 

exactly we could not determine p exactly. All we could say was that p 
was in a certain range, this range depending upon the ranges of a and g. 
Thus we have an identif iability problem and p is not estimable in that 
an unbiased estimate of p does not exist. 

Figure J-1 illustrates the range of p as a function of p for 
0.65 g a £ 0.90, 0.05 < g g 0.25. The range of p is 0.05 < p < 0.90. 

Now let p be an estimate of p and let E(p|p,a,g) denote the expected 
value of p given p, a, and g. Then for fixed p^ and all triplets p,a,g 
for which p^ = (a-g)p+g 

E(p|p,a,g) 

is constant. Thus for this set 

E(pjp,a,g) - p 

p^ = (a-g)p + g 


max 

a,g,p 
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is at least half the range of p in this set. For = 0.50, the range 
of p (indicated by the vertical segment in Figure J-1) is from approxi- 
mately 0.38 to 0.75. This means that for any sample size and any esti— 
mator the absolute value of the bias will be at least 0.18 for some p 
on the range 0.38 to 0.75. Nor will any sampling scheme improve the 
situation; only additional prior information about a, 8, and p will help. 


J.2 A MODEL FOR PREDICTING THE BIAS AND VARIANCE OF PROCEDURE M 
STRATIFIED SAMPLE ESTIMATES WHEN THERE ARE LABELING ERRORS 


Procedure M clusters blobs into spectral strata in order to gain 
the efficiency of stratified sampling in forming crop estimates. We 
have measured the success of this clustering by comparing two variances: 
the variance of the stratified sample estimate and that of the unstrati- 
fied sample estimate. The biases are not considered in this evaluation 
because they can be shown to be zero. The ratio of the two variances is 
a convenient way to express the comparison; it provides a single number 
to represent the performance of a particular clustering algorithm or 
parameter setting. 


was 


Our first expression for this "reduction of variance factor" 

E n . p . (1-p . ) 
np(l-p) 


[42] 

(J-1) 


where' 

n^ is the number of pixels in stratum i, 
n is the total number of En^ of pixels in the segment, 
p. is the true proportion of wheat in stratum,' 
p is the true proportion En^p^/n of wheat in the segment. 


This expression is based on the assumptions that sampling from the strata 
is binomial with replacement and that sample blobs are allocated to 
strata in proportion to the number of pixels in each stratum. 
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In the tolerance block study [40 ] , the second assumption was aban- 
doned as unrealistic, and an ”m-sample reduction o£ variance factor" 
was used. This ratio of variance is computed by allocating a sample 
of size m as nearly as possible proportional to the size of the strata, 
but subject to the necessity that the allocation be an integer. The 
ratio turns out to. be 


2 /^i\^ P.-d-P.) 

all strata i I — I 


a . 

1 


‘ pC1-p) 


(J-2) 


m 


where is the number of sample blobs allocated to stratum i. 


m is the size Sa^ of the sample. 


This measure, too, suffers from the unrealistic assumption that 
the blobs are sampled with replacement. The actual sampling technique 
is unbiased [3 ] but with a- variance that is exceedingly difficult to 
calculate. A model is necessary and it can be made more realistic by 
sampling without rep’lacement — r a model we should have used in the 
tolerance block study. The simplifying assumption of the model is that 
we are sampling blobs and making estimates without regard to the number 
of pixels in the blob. 

The number of wheat blobs in the sample from stratum i then has the 
hyper geometric distribution with variance 


a.p^(l-Pi) 



(J-3) 


where b. is the number of blobs in stratum i. The reduction of variance 

X 

factor becomes 


all strata 


VVM 


pd- p ). 

m 


N 


(J-4) 


where b is the total number Sb^ of blobs in the segment. 
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When we extend the model to include labeling error, we introduce 
the possibility of bias. We therefore have two performance measures to 
consider, bias and variance, and a ratio of variances is no longer 
especially convenient. 

A simple labeling error model, widely used at JSC and ERIM, is of 
the following pattern: 

The Probability 
of Choosing 

Wheat 
Other ■ 

The probability that a blob 


When We Truly Have 
Wheat Other 


a 

1-a 


3 

1-6 


(J-5) 

from stratum i will be labeled wheat is 


= ap^ + g(l-p^) = (a-B)p^ + 3 

This fact allows us to obtain the bias of the wheat estimate whether 
the model or the Procedure M sampling technique is used. For complete- 
ness, we present the derivation here. 

The wheat estimate for stratum i is 


w. 

1 


TA ,x. 
J J 
TA. 

3 


where x. is the labeled percent x<rheat of blob j in the sample, 

A. is the number of pixels in blob j if we use the Procedure M 
^ sampling scheme, 

A is 1 if we use the model. 

3 


The expected value of w^ is 


SA . Ex. 

_J 1 

Z\. 

3 


= Ex. 
X 


^i 


(J-6) 
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The wheat esthnate for the segment is 


Sn.w. 

X X 


It's expectation is 


2n . 

X 


Hn.q. Zn^[(a-g)p^ + 3] 
2n . 2n . 

X X 


Sn.p-. 


= (a-3)p .+ 3 


The bias is this expression minus p 


= (a-3)p + 3-p = (ot-3-l)p +-3 

If we suppose a model where a and 3 are different for different 
strata, the bias is 


(J-7) 


(J-8) 


En. 

X 


The variance of the wheat estimate is obtained from the variances 

of the stratum estimates. We approximate each stratum estimate by using 

the hypergeometric model (sampling without replacement) . We let W be 

the number of true wheat blobs in the stratum sample and L, the number 

of blobs labeled wheat. We are assuming that W is hypergeometric and 

therefore has mean a.p. and variance 

x*^x 


^iPi(l-Pi) ^b^j 
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We obtain the variance of L from the general theorem 

Var L = EVar(L[w) + Var E(l[w) (j_ 9 ) 

To begin with, we will evaluate the first term EVar(L|w). When W is given, 
L is the sum of two independent random variables: the number of wheat 

blobs labeled wheat and the number of non-wheat blobs labeled wheat . 

The first is binomial (W,a), and the second binomial (a^-W,3). The 
variance of the sum L is the sum of the two variances 

a(l-a)W + e(l-3)(a^-W) 

The expectation of this term with respect to W is 

tx(l~a)a^p^ + B(l-e) (a^-a^p^) 

'= a^[a(l-a)p^ + 3 (i-6) (1-p^) ] 

After algebra, this expression can also be written 

a^[3-3^ + (a-3) (l-a-3)p^l 

The second term Var E(l|w), obtained as before by considering L as the sum 
of two binomials. 


= Var[aW + 3(a^-W)] 

= Var[(a-3)W + 3a^] 

The constant term does not affect the variance and can be omitted. Hence, 
Var E(l[w) = Var(a-3)W = (a-3)^Var W 


~ a^Ca-e)^ 


Pj_Cl-Pj_) 



The variance. of L is the sum of the two terms. What we really want is the 
variance of L/a^, the wheat estimate for stratum i based on the labeled blobs. 
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This is 


Var L 




9 9 /b,-aj 

3-P^ + (a-g) (l-a-e)p . + (a-g) p.(l-p.) 

1 X X ^ b^-ij 




(a-a^)p^ + (g-g^) (1-p^) + (a-g)^p^ 


(J-10) 




X . 


The variance of the segment estimate is the sum of these stratum variances 
2 

weighted by (n^/n) . The variance of the unstratified segment estimate 
is a single term analogous to (J-10) with p, b and m replacing p^-, b^, 

-and . If « and g vary' from stratum to stratum, then and g^ replace 
a and g in (J-10) . 

Incidentally, theorem (J-9) may be used to verify that the variance 
of the with-replacement estimate is 

[(a-g)p^ - 2g(a-g)p^ + (a-g)^ p^^ + g-g^]/a^ (J-11) 


The result 'can be checked directly because the number L of blobs- labeled 
wheat now has a binomial (a.,q.) distribution with variance a,q,(l-q.) 

X ^x x^x ^x 

which agrees with (J-11) when q^ is expanded. 

J.3 BIAS AND VARIANCE WHEN LABELING ERRORS ARE DISTRIBUTED 

In Section J.2, above, a model was developed for the bias and 
variance of the Procedure M estimate under labeling error. The result 
was that the expected value of the wheat estimate is 

(a-g)p + g, . (J-12) 

the bias is 

(a-g-l)p + g, (J-13) 
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and the variance is 

(a-a^)p + (3-g^) (1-p) + (a-e)^p(l-p) 

a 


(J-14) 


where 

a = probability of labeling a blob wheat given that it is wheat 

3 = probability of labeling a blob wheat given that it is not wheat 

p = proportion of wheat in a stratum or segment 
b = number of blobs in the stratum or segment 

a = number of blobs in the sample from stratum or segment 


The model was based on the assumption of a fixed a and 3. Here we 
apply the model when a and 3 have a knom distribution. 

Let Ea, E3, var a and var 3 denote the expected value of a and 

of 3 and the variance of a and of 3j respectively. Let cov(a,3) be the 

2 

covariance of a and 3. Ea can either be thought of as the expectation 

2 2 2 ■ 
of a or as var a + (Ea) , and similarly for E3 . Ea3 is the expectation 

of a3 or equivalently, cov(a,3) + (Ea)(E3). 

To get the bias of the wheat estimate p we use the conditional 
expectation formula 

bias = E(p-p) = Ea3E[(p-p)la,3l (J-15) 

Now E[(p-p)|a,3] is given by (J-13). Hence 

E(p-p) = E[(a-3-l)p + 3] 

= (Ea-E3-l)p + E3 (J-16) 

To get the variance of p we use the conditional variance formula 

var p = var a3 E (p I a , 3) + Ea3 var (p|a3) (J~17) 
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We will evaluate (J-17) one term at a time. The first term Is, using 
(J-12) , ■ 

var ag [ (a-g)p + B] 

= var [pa + (l-p)S] 

= p^ var a + (1-p)^ var g- + 2p(l-p) cov (d,g) (J-18) 

The second term Is, using (J-14) , 

(a-ci^)p + (g-g^)(l-p) + (a-g)^p(l-p) 

Eag 

a 

(Ea-Ea^)p + (Eg-Eg^) (1-p) + (Ea^-2Eag+Eg^)p (1-p) ( ^ ) 

= (J-19) 

a 

The variance of p is the sum of (J-18) and (J-19) . 

Independence of a and g were not assumed because there is generally 
a relationship between Type I and Type II errors in detection problems. 

If the system is adjusted to make a as large as possible, then g is also 
increased, and if we try to reduce g, we may end up reducing a. 

The detection component in the wheat estimate is a human being who 
may be sensitive to pressures to estimate a reasonable proportion of 
wheat for the segment. It may be, therefore, that if his a is low, he 
may tilt his judgment to increase g to make the overall- proportion accept- 
able. Thus, an inverse, rather than a direct relationship between a and g 
could apply in this case. 

The distribution of a and g will be difficult to estimate with con- 
fidence. If we guess the distribution, then the expected bias becomes 
a simple sum (J-16) of our guesses. ' The distribution depends on the preju- 
dices and pressures affecting the AI's which are difficult to quantify. 

It would vary considerably from AI to AI and from situation to situation. 

Our experiment on three AI's would be a very small sample from which to 
estimate this distribution. 
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J.4 LABELING ERROR IMPACT ON THE 90/90 CRITERION 

It was shown in Sections J.2 and J.3 that the bias of the Procedure M 
target crop proportion estimate in the presence of labeling error is given 
by the expression 

b == ap + g(l-p) - p (J-20) 

where b is the bias, 

p is the true proportion of the target crop, 
a is the accuracy of labeling that target, 
and B is the inaccuracy in labeling other (errors of commission) . 

Equation (J-20) can be written 

b = (a-B-l)p + B (J-21) 

expressing the bias b as a linear function of p in the parameters a and 
B, with slope a-B“l and y intercept B* 

Figure J-2 illustrates the parametric equations over the dynamic 
range of a, B, and p: 

0.0 :< a <. 1.0 

0.0 ^ B 1.0 

0. 0 ^ p ^ 1.0 

Note that an unbiased estimate of p can be achieved by compensating 
errors of omission and commission relative to the proportion p. In the 
extreme, errors of total commission (B=1.0) result in an unbiased esti- 
mate only if p=1.0, that is, no other exists in the population; and 
errors of total omission (a=0.0) result in unbiased estimates only if , 
p=0.0, that is, no target crop exists in the population. Similarly, a 
commission error of .5 and omission error -of .5 is unbiased only if the 
target crop is 50% of the population. 

It was shown in Section J.l that without knowledge of the range 
of a and B and their distribution -over that range, that an unbiased 
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estimate of p does not exist. Here we will examine what limited know- 
ledge of these parameters gained by experience implies with respect to 
the 90/90 criterion. 

Evaluations of analyst labeling error in LACIE suggest that the 
following are reasonable ranges of a and 3 in labeling wheat: 

.6 < a < ,8 

.0 < e < .1 


Though little is known about the distribution of error over this range, 
for purpose of illustration we" will assume the error can be described by 
a normal distribution or other definable function. 

Figure J-3 illustrates the range of bias for a uniformly distributed 
and bounded error. The maximum positive bias in this range, is .1 when 
a=.8 and 3=.l, occurring when p=.0. The maximum negative bias occurs 
when p=1.0, a=.6, and 3=*0. If p exceeds .33, an unbiased estimate is 
not achieved in this range of ct and 3. In addition, the target crop .is 
always underestimated. Note that if j a system is biased, one way to im- 
prove overall accuracy is by introducing compensating error in labeling. 

Further, restricting p to the typical range 

• 2 ^ p ^ . 5 


we find the bias limited to the range illustrated in Figure J-4. Note 
that an overestimate will occur only if p < .33 and errors of commission 
approach maximum, while errors of omission approach minimum. Associated 
with the points (A,B,C,D) are the relative error rates (.2, -.4, -.1, -.4). 
If the stated range of performance were the absolute range, the 90/90 
criteria would never be achieved for p=.5. 

Over the stated range, 90% accuracy is achieved wherever 

- ^ .1 (J-22) 

■P 
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FIGURE J-3.’ RANGE OF BIAS FOR 
. 6 <. a <. . 8 
0 ^ 8 :< .1 





(J-23) 


2 ™ 

Using Equation (J-21) , 90% is achieved where; 

(a-3“.9)p + 0 > 0 and (a— 3-l.l)p + 3 < 0 

Prom (J-22), this is necessarily contained in the range 

-.Ip i b ^ -Ip . . (J-24) 

Figure J-5 illustrates Equation J-24 (the dashed lines) and Equa- 
tion J-23 (the shaded area) . If one associates a distribution of labeling 
error the efficiency of the system can be computed. For example, if the 
specified range totally bounded a uniform distribution of error, then 
the efficiency of achieving 90% accuracy over the range .2 :< p :< .5 is 
the ratio of the area shaded in Figure J-5 to the area shaded in Figure 
J-4. At a given p, the efficiency is the ratio of the respective ranges 
of b at that p in the two figures. For example, the efficiency of 
achieving 90% accuracy at p=.5 is 0%. On the other hand, if the dis- 
tribution of labeling error were normally distributed, the efficiency 
of this system at a 90% accuracy level is the area under the normal curve 
at a p within the range specified by Equation J-24. This is illustrated 
in Figure J-6. 
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APPENDIX K 


INFOKMATION THEORETIC MEASURES OF AGRICULTURAL 
INVENTORY SYSTEM PERFORMANCE 


Multispectral remote sensing systems are being used for agricultural 
inventory applications. Their overall performance usually is measured 
by their accuracy in estimating crop acreages (or crop proportions in 
sample areas) or by their marginal probability of correct classification 
of known scene elements. Intermediate stages of some area estimation 
procedures involve stratification and sampling of subareas (notably, 
LACIE's Procedure 1 and ERIM's Procedure M) . 

The variance reduction factor is currently used to measure the per- 
formance of such stratification procedures. This appendix Introduces and 
develops alternative performance measures or figures of merit that should 
be of value, particularly when several classes or crops are involved.* 
These new measures are based on information theory concepts. Application 
of information theory concepts to other and more general aspects of area 
estimation problems may also prove productive and is briefly discussed. 

K.l INFORMATION THEORY CONCEPTS 

Consider the simple communication channel portrayed in Figure K-1. 
The input has m possible states and the output n possible states; in 
general, m ^ n. A transfer of information occurs when the output state 
is Influenced by the input state. The transfer is perfect when the out- 
put always has a 1:1 correspondence with the input or, in other words, 
when there is no uncertainty about the input given a particular value 
of the output. Noise in the communication channel can introduce errors 
or uncertainty about the input state for any given output state. 

These concepts were first reported at the June 1979 Quarterly Review 
under this contract, as documented in ERIM 132400-24-P, Analysis of 
Scanner Data for Crop Inventories, Environmental Research Institute 
of Michigan, Ann Arbor, MI, August 1979. 
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■ Input, 

Communication 

i = 1, . . . ,m 

Channel 

nx = number of 


possible 

I 

input states 

Noise 


Output, 

J = 

n = number of 
possible 
output states 


FIGURE K-1. DIAGRAM OF A COMMUNICATION CHANNEL 
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K . 1 . 1 SELF -INFORMATION 

To quantify information transfer processes, information theorists, 
beginning with Claude Shannon, have defined an information measure. The 
self information or information associated with the known occurrence of 
a state x^, which occurs with probability P(x^), is defined to be: 

Kxp . log^ (p(Sp} ■ 

where the base "a" usually equals two, in which case the units of infer- 
matlon are in "bits". This quantity also may be interpreted as the 
uncertainty in x^ or the (a priori) information needed to makex^ cer- 
tain. The more rare the state, the greater is the information associated 
with its occurrence. A useful characteristic of this measure is that 
it is additive, e.g., the total information represented by two inde- 
pendent events is the sura of their self-information values. 

K.1.2 ENTROPY 

Next, let us consider the average amount of information associated 
with repeated observations of the event. This quantity is called the 
entropy or average uncertainty of x; it is the average (a priori) infor- 
mation needed to make x certain. The entropy over the ensemble X = {x.} 
is 


m • m / 1 

H(X) = - I P(x ) log P(x ) = I P(X ) log(:rY^ 
i=l ^ "" i=l ^ 


(K-2) 


If the probabilities' P (x^) are equal, then P(x^) = ~ and H is a maximum: 


H = log m 
max 


(K-3) 


If the base a = 10, the units of information are in "hartleys" and, 
if a = e, the units are in "nats". 
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Te rih 

The ininimum value for H is zero and results when a single state occurs 
with probability one, i.e., there- is no uncertainty about the input state. 

Since the value o£ entropy depends on the probability distribution 
of the input states, a relative measure is sometimes defined. The rela- 
tive entropy is normalized by the maximum entropy possible for the given 
number of input states, i.e. , 



max 


(K-4) 


K.1.3 JOINT INFORMATION 

The information content of the joint occurrence of two events depends 
on their joint probability of occurrence, P(x^,y^): 

= “log P(x^,y^) 

But , since 


Similarly, 


= P(y^) P(x^|yJ 


(K-5a) 

(K-5b) 


we have 


Thus , 


l(x.-.y.) = -log[P(x ) P(y |x )] 

■*“ J J 

I(x^,yj) = -log P(x^) -log P(yjjx^) 


I(x. ,y.) = I(x.) + lCy.|x.). 
X j X ' X 


(K-6a) 


i(Xi,yj) = 


(K-6b) 
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K . 1 . 4 INFOEMA.TION EXCHANGE . 

If and are independent, then 

P(y^jx^) = P(y^) 

and 

Ky^ix.) = I^) 

giving 

r(x.,y^) = I(x^) + i(yj) 


This is the maximum value of r(x^,y^). However, from the standpoint of 
a communication channel it represents no information transfer from input 
to output . 


.Maximal information transfer occurs when the input x^ and output y^ 
are perfectly correlated, i.e., when 

P(yj |x^) =1 
- 1 

and 

P(Xj.,y^) - P(x^) = P(y^) 


giving 

i(Xi,yj) = i(x^) = i(yj) 


Let us then define 1^^, the next exchange or transfer of information 
through the channel, to be: 


or 


= Max{l(x^,y^)} - I(x^,y^) 

= I'Cx^) + Kyj) - Kx^.y^) (K-.7) 
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Other expressions can he obtained by substituting Equations K-6b and K-6a 
into Equation K-7: 


I (x,,y,) = I(x.) - I(x.[y.) 


= I(y^) - Ky^lx^) 


(K-8a) 

(K-8b) 


From Equation K-8a we see that the joint occurrence provides all the 

information associated with x. , less that lost due to confusion or the 

x’ 

a posteriori uncertainty about which x was sent given that a particular y 
■was received. Again, if x^ and y. are perfectly correlated, then 
I^(x^,y.) = I(x^) and the net information exchanged then is the entire 
input information. At the other extreme, if y. is independent of x., 

1 X 

then ” O 5 the output bears no relationship to the input, and 

the net exchange is zero. 

By utilizing Equation K-8a and the definition of self information, 
we obtain the following: 




(K-9) 


and 


I (x . ,y . ) = log f ^ 


n- x"3 


P(x^) 


(K-10) 


k.1.5 mutual information 

The definition for entropy H(X) in Equation K-2 is the expected value 
of the self information I(x^). Similarly, we can define the mutual infor- 
mation to be the expected net information or average information exchanged 

I(X;Y) = E{l^(x^,y^)} (K-11) 
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Thus , 

m n (x . ( j- , ) \ 

I(X;Y) = I I P(x^,y ) log ■ j . (K-12) 

1=1 j=l \ ^ i-' / 

■ Other forms are, patterned after Equation K-10, 

I(X;Y) = H(X) + H(Y) - H(X,Y) (K-13) 

and, patterned after Equation K-8, 

I(X;Y) = H(X) - H(x|y) 

((X;Y) = H(Y) - H(y|x) (K-14) 

The interrelationships between the quantities in Equations K-13 and K-14 
are diagrammed in Figure K-2. If we consider x^ to be the input and 
the output of the _ communications channel in Figure K-1, then: 

I(X;Y) is the average information exchanged or the mutual 
information, 

H(X) is the entropy or average self information of the 
input, also referred to as the a priori entropy, 

H(x|y) is the equivocation or average loss of information 
about the input given the output of the noisy 
communications channel, also referred to as the 
a posteriori entropy or average uncertainty about 
the input given the output, 

h(y|x) is the average error in the output information 
given the input, and 

H(X,Y) is the total entropy ,or average information • 
associated with the channel. 

The maximum possible value of I(X;Y) is called the channel capacity, C. 
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The relationships discussed herein assume that selection of any 
given input state ^ not influenced by any previous .state selection, 
i.e., that the source does not possess memory. If this is not true, 
more involved relationships apply. 

K. 2 LINKING INFORMATION THEORY CONCEPTS TO AREA ESTIMATION SYSTEMS 
' EMPLOYING REMOTELY SENSED DATA 

Let us begin by adapting the communications channel concept of 
Figure K-1 to elements of systems that employ and extract information 
from remotely sensed data for use in area estimation. To do so, inter- 
pret the' communications channel itself as consisting of the information 
extraction and processing operations which are conducted to derive or 
estimate various characteristics of the observed scenes. For any desired 
output quantity, there' are appropriate input characteristics and process- 
ing operations, as shown in Figure K-3. The desired output may vary, 
from a single crop proportion estimate over a worldwide production area 
or subunit, on one hand, to a detailed pixel-by-pixel or hectare-by-. 
hectare map of crop production in selected agricultural areas on the 
other hand. 

The underlying idea proposed here is to use information theory 
approaches to measure system performance by finding the appropriate 
pair of input-output- quantities and considering the Intermediate proc- 
essing to be a communications channel. The less closely the derived 
output matches the input characteristic being estimated, the "noisier" 
is the processing channel. Information theory provides quantitative 
measures of this degradation or failure of the estimation system to 
duplicate the true input information. 

Sections K.3 and K. 4 discuss the problem of measuring the perform- 
ance of a stratification procedure, whether it be the spatial-spectral 
stratification performed by the BLOB algorithm, the spectral stratifica- 
tion performed by BCLUST, or the classifier stratification and adjustment 
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INPUT 


CHANNEL 


OUTPUT 



(a) True Class Proportions 
of Inventory Area 

(b) True Class Proportions 
of Sample Areas 

- Two-class 

- Multi-class 

(c) True Class Strata 

(d) True Class Composition 
of Derived -Strata 

- Two-class 

- Multi-class 

(e) True Spatial Definition 
and Labels of Fields 

- Two-class 

- Multi-class 

(f) True Proportional Labels 
of Derived Quasi-Fields 

- Two-class 

- Multi-class 

- Full vs. Interior Only 

(g) True Labels of Pixels . 


Stratifier ' 

Classifier 

Proportion 
Estimator , 


(a) Estimated Class Propor- 
tions of Inventory Area 

(b) Estimated Class Propor- 
tions of Sample Areas 

- Two-class 

- Multi-class 

(c) Derived Scene Stratifi- 
cation 

(d) Estimated Composition 
of Strata 

- Two-class 

- Multi-class 

(e) Derived Quasi-Field 
Spatial Definitions 

(f) Labels Assigned to 
the Quasi-Fields 

- Single-class 

- Proportional 

- Full vs. Interior Only 

(g) Labels Assigned to Pixels 


FIGURE K-3. COMMUNICATIONS-CHANNEL VIEW OF INFORMATION EXTRACTION 
AND AREA ESTIMATION SYSTEMS USING REMOTELY SENSED DATA 







performed in Procedure 1. In order to compute Information theoretic 
quantities, it is necessary to substitute stratum proportions for state 
probabilities in the formulas. 

As a digression to indicate relative magnitudes of information 
quantities, let us consider a 5x6-mile LACIE segment which contains 
22,932 Landsat pixels. In principle, each pixel could have a unique 
spectral vector and represent a unique scene class. In that case, the 
maximum entropy would be; 

H = log, (22,932) = 14.485 bits/pixel 

At the other extreme, if the segment's pixels were either wheat or non- 
wheat, the maximum entropy would be: 

H = log, (2) = 1 bit/pixel 

Between these extremes, we have roughly 1000 quasi-fields per segment, 
giving 

H - log„ (1024) = 10 bits/pixel 
max z 

and 32 to 128 spectral strata, yielding 

H = log., (128) = 7 bits/pixel 
max L 

and 

H = log, (32) = 5 bits/pixel 
nid-x ^ 

K.3 .DERIVATION OF A FIGURE OF MERIT FOR STRATIFICATION 

Consider the problem of evaluating the performance of the BLOB 
algorithm in delineating quasi-fields in the image data. The output 
characteristic here is the resultant grouping or stratification of 
pixels into quasi-fields. The input characteristic is the true labeling 
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of the pixels in "ground’-truth” data associated with each pixel. Ideal- 
ly, one would want pure or single-class quasi-fields to be defined. The 
extent to which the stratification algorithm achieves this purity is 
reflected in the mutual information of the above-described input and 
output. 

As was shown in the preceding section, the mutual information 
I(X;Y) depends on the average self information or entropy H(X) of the 
input. This entropy will vary from scene to scene and segment to segment. 
Mutual information may be the appropriate measure for multi-segment or 
multi-area analyses. For other purposes, such as within segment compari- 
son of processing techniques, however, normalization may be desirable. 
Therefore, the following figure of merit is defined:* 


H(X|Y) 

H(X) 


h(x|y) measures the average, loss of information about X in passing through 
the system. 

If the stratification gives no information about the classes being 
observed, i.e., there is a total information loss, then H(x|y) = H(X) 
and = 0. On the ot'her hand, if X is determinable exactly from the 
stratification Y, there is no transmission loss and no remaining uncer- 
= 0 and = 1. Thus, one interpretation of 
the transmittance factor for input information through the processing 
system. 

In expanded form, the figure of merit is; 

I P(yJ I P(x |y ) log[P(x |y )] 

“x - 1 - 

• 51 P(x.) log[P(x,)] 

i=l 

Figures of merit based on other normalization factors are discussed 
in Section K. 4, 


is as 


tainty so H(x|y) 
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A two-class .version of Equation K-16 is helpful for many of our 
current studies. For convenience, it is expressed here in terms of the 
wheat and nonwheat classes; 





PW. +P^ log p^ 

L- 3 2 ^ jJ 

Py + % % 


where 


n^ = number -of pixels in Stratum j , 

N = total number of pixels being considered, 

p^ = proportion of wheat in -Stratum j , . 

3 

p^ = proportion of nonwheat in Stratum j , 

j 

= overall proportion of wheat, 

and 

P=y = overall“proportion of- nonwheat. 


(K-17) 


Calculation of the figure of merit was incorporated into one 
of the programs used to evaluate the two-class performance of the BLOB 
algorithm in our Fall 1978 test of Procedure M [ 3 ]. Table K-1 lists 
results obtained for 27 segments, along with the corresponding variance 
reduction factors and average blob purity percentages. Values are 
included separately for full big blobs and for their interior pixels . 


Figure K-4 presents a scatter diagram of variance reduction factor 
vs, figure of merit values for blob interiors. Note the high degree of 
correlation between them, except for a few outliers. These outliers are 
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TABLE K-1. RESULTS OBTAINED IN 1978 TEST OF PROCEDURE M 


FULL "BIG" BLOBS BLOB INTERIORS 


n. 

CASt» 

1. 

segment 

P, 

BV.FULI 

3. 

eoh.fijll 

9. 

PIJR.FULL 

b. 

RV.INT 


6. 

FOM.INT 

7. 

PUW.INT 

1 

1 10«,0 

,30800 

.75000 

97.210 

,10100 


,90900 

98,250 

2 

l-JOB.O 

.55500 

,65200 

90,270 

.13600 


.83200 

95,270 

i 

1312.0 

,37000 

.58100 

88,710 

. 12000 


,86500 

96.000 

t| 

1515,0 

,21500 

,75700 

96.160 

,22000 

-1 

.97900 

99,370 

5 

1515.0 

,27600 

,69500 

90.770 

,baODO 

-1 

,96500 

97,620 

6 

1520,0 

.39600 

.60100 

91,090 

.13700 


.86100 

96,110 

7 

1602.0 

.67300 

.69600 

B3.720 

,18800 


,80800 

93,920 

6 

1606.0 

.55200 

.61600 

ea,P'»o 

.10700 


,85700 

96.110 

9 

161^,0 

.55800 

.58600 

81 .230 

.25500 


.71600 

91,070 

10 

1625,0 

.57800 

.60600 

65.820 

.32100 


,76600 

92.990 

n 

1635.0 

.25100 

.69600 

91.660 • 

,76000 

-1 

.91200 

97,190 

12 

1637.0 

.56100 

,60600 

89.270 

,10200 


.88600 

96.610 

13 

1600.0 

• 3SbO 0 

.58800 

87.550 

, 11600 


.86600 

95.980 

1« 

1652.0 


,61000 

80.650 

.35300 


,62000 

85,390 

lb 

166’2.0 

.39100 

.55900 

66,310 

.16100 


, ,65000 

96,610 

16 

1665.0 

,22700 

.73200 

91.920 

,26000 

-1 

.9bB0Q 

96.770 

1 1 

I6b9,0 

.51900 

,56300 

92.590 

,66200 


,77500 

95,720 

1» 

loei .0 

.28500 

,67100 

90,910 

,71000 

-1 

,90900 

97.170 

19 

1669,0 

.27700 

,68500 

93,660 

.82000 

-1 

.90300 

97,600 

*>0 

1000,0 

,57100 

.58600 


. 10900 


.BSbOO 

96,610 

21 

1803.0 

.50100 

,60700 

9R.680 

.38900 


,91800 

98.960 

22 

1805.0 

.52700 

.'ibVno 

90,010 

.33300 


,b8900 

93.950 

25 

1811.0 

.36900 

,75800 

97.930 

. 11500 


,92500 

98,720 

2« 

1899.0 

,P6600 

,69700 

91 .290 

,63000 

-1 

,95800 

98,660 

2b 

1913,0 

.63900 

.60700 

92,150 

.20600 


,76300 

96,700 

2b 

1927,0 

.29600 

.65500 

91 .260 

.99000 

-t 

,87200 

96,660 

27 

1929.0 

• .76200 

.23900 

67.760 

.62000 


.59600 

82.310 


Notes: RV = Reduction of Variance Factor 

FOM = Figure of Merit, 

PUR = Average Purity (Percent) 
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VARIANCE REDUCTION FACTOR 
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FIGURE K-4. COMPARISON OF VARIANCE REDUCTION FACTOR 
AND FIGURE OF MERIT FOR BLOB INTERIORS 
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segments with low proportions of spring small grain, in which case the 
entropy-based figure of merit appears to give more consistent results. 
Comparable correlations with average blob purity also were observed. 


K.4 ADDITIONAL STRATIFICATION PERFORMANCE MEASURES 

In stratification, the true class associated with each output 
stratum is not of importance; only the stratum purity is of concern. 

For example, whenever all members of stratum y^ are of the same input 
class, P(x^|y^) is unity for one of the i's, say i = I, giving 

log P(xj|y^) = log (1) = 0, so H(Xj|y^) = 0 irrespective of the value 

of P(x.,y,). For each other i I, P(x,,y.) = 0 and H(x.]y.) = 0, so 
^ J 1 1 2. j 

H(x|y.) = 1 H(x.|y.) = 0. Thus with pure strata, H(X|Y), the equivoca- 
^ i ^ J 

tion or average expected loss of information, is zero. As a result, 
is a maximum, = 1, because the integrity of the input classes has 
been preserved by the stratification procedure. There is, however, one 
important characteristic that is not measured by M^. 

The unmeasured characteristic is the number of strata defined at 
the output. As noted above, when all strata are pure, H(x[y) is zero 
and = 1; this is irrespective of the number or size of the strata. 

An extreme case would be if each scene element or pixel were assigned 
to a unique stratum’ and each and every pixel were pure, i.e., belonged 
to only one of the input classes; here again would be unity, an 
undesirable value for a performance measure in this situation. 

The information content (self information) of the output is 
measured by the output entropy II (Y) which does depend on the number 
of output strata. This suggests a normalization by H(Y) instead of 
H(X), giving rise to: 


M = ICX;Y) ^ H(Y) - H(y|x) ^ , _ H(Y|x) 
^ H(Y) H(Y) H(Y) 
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Parts (a) and (b) of Figure K-5 illustrate the relationships between 
quantities used to define the figures of merit and 
to the diagrams in Figure K-5, it is obvious that (as well as the 
other figures of merit) has a maximum value of unity and a. minimum 
value of zero. To better understand the maximum of M^, consider the 
following version: 


My. Referring 


_ H(X) - h(x|y) 
“y H(Y) 


(K-19) 


TT 

If all output strata are pure, H(x[y) = 0, resulting in y . 

This quantity is less than unity, except when H(Y) = H(X), i.e., when 
these pure output strata exactly match the input classes in number and 
size. The minimum value, = 0, results when the. output Y is inde- 
pendent of the input X and HCy[x) = H(Y) (See Equation (K— 18)). 

Weighted-sum normalization factors were suggested as possibilities 
(See Figure K-5(d)), arising out of a concern that might impart too 
severe a penalty on a system, as a function of the number of output 
strata produced. Figure K-6 illustrates with solid lines the rela- 
tionships for two input classes of equal size and 2^ output classes of 
equal size. A pronounced decrease in the curve is also present as 
the average purity of output strata decreases, as shown for 95^ and 90% 
purity strata. Since remains constant at the value for two output 
strata, division by a weighted sum of H(X) and H(Y) would decrease the 
rate of falloff from that shown in Figure K-6. However, a suitable 
criterion for' selecting the weighting factor m has not yet been found. 


The final normalization factor considered herein is H(X,Y) : 



I(X;Y) 

H(X,Y) 


H(X) - H(X 

Y) 

H(Y) + H(X; 

Y) 


(K-20a) 


'■k,' 


H(Y) - H(Y 

X) 

H(X) + H(Y 

X) 


(K-20b) 
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[NO DIAGRAM] 

(d) WEIGHTED SUM OF ENTROPIES =» jE^)H(Y) 

FIGURE K-5. ILLUSTMTION OF VARIOUS NORMALIZATION FACTORS 
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FIGURE K-6. COMPARISON OF INFORMATION THEORETIC FIGURES OF MERIT 
FOR STRATIFICATIONS OF TWO EQUALLY LIKELY INPUT CLASSES 
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(K-20c) 


An appealing feature of this figure of merit is the symmetry exhibited by 
Equations K-20a and K-20b. Upon comparing Equation K-20a to Equation K-19, 


one also sees that ^ decreases more rapidly with increasing output 

entropy than does whenever H(xjY) / 0; see dashed lines in Figure K-6. 


Both numerator and denominator are affected by the information loss. When 
H(x|y) = 0 and there is no information loss, ^ == and prior 

comments apply. 


Another Insight into this last figure of merit is facilitated by 
Equation K-20c. The denominator represents the sum of two terms, the 
entropy or self information of the input and the error associated with 
the output stratification, i.e. , the uncertainty beyond that associated 
with X. This is the total entropy of the processing channel and equals 
the input entropy only when the error is zero. 


The relationships in Equation K-20 also have implications for sam- 
pling requirements to identify or determine which input class (es) should 
be associated with each output stratum, e.g., for proportion estimation. 

If the strata were pure and the label determination perfect, only one 
sample per stratum would be required. The amount of information (in bits) 
acquired from each sample would differ, however, according to the size of 
its stratum, but summed together they would represent the output entropy 
H(Y). Yet as can be seen in Equation K-20a, there is an additional term 
paired with H(Y) in the denominator, i.e., H(x|y) which is the equivoca- 
tion or average expected loss of information. H(x|y) reflects the impurity 
of the output strata. One way to recover this lost Information would be 
by additional sampling so that proportional labels could be assigned to 
some strata. H(x|y) would appear to represent the needed information 
content of additional samples, but relationships between entropy and sam- 
pling have not yet been established. 
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To summarize this section, it appears that a figure 'of merit for 
evaluating the performance of stratifiers should include the output 
entropy in the normalization factor so as to penalize a stratifier with 
too many strata. Several candidates were discussed, with which 

normalizes by the total entropy, having several desirable features. Test- 
ing and evaluation should be conducted to make a choice. ■ Also, relation- 
ships between sampling and information measures need study and development. 

K.5 SUMMARY AND DISCUSSION 

The concept of relating information theory concepts to evaluation 
of area estimation systems employing processed remotely sensed data has 
been developed and presented. New figures of merit have been defined 
for comparing the relative' performances of various stratification, 
clustering, and classification operations. 

One figure of merit measures the transmittance of scene informa- 
tion by the processing system to the user. Example use in Blob per- 
formance evaluation was presented, with preliminary indications of more 
consistency than the variance reduction factor — more analysis is re- 
quired to be definitive. 

The approach may be even more appropriate for the evaluation of 
spectral stratification techniques, since spatial entities (and, there- 
fore, the input entropy, H(X)) will remain constant from technique to 
technique. However, use of the ^ figure of merit would be advantageous ' 
since it would penalize a stratifier with a large number of strata. 

Information theory measures may well not be the complete answer, 
but they appear to have potential. Therefore, it is recommended that 
their utility in measuring system and component performance be investi- 
gated and evaluated as supplements to current measures, such as variance 
reduction factor, proportion bias, and probability of correct classifica- 
tion. The figures of merit defined here provides a starting point; addi- 
tional uses or measures may result from analysis of their use, advantages, 
and disadvantages. 
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■APPENDIX L 

DESCRIPTION OF DATA BASE FOR SMALL GRAINS 

The purpose of this section is to extend the data base discussion 
begun in Section 7, and specifically to present the segments selected, 
to describe the processing algorithms used, and to indicate the form 
of the data. Much of the discussion of Section 7 is repeated to make 
this appendix self-contained. 

The data base consists of 67, 5x6-mi segments located through- 
out the United States Great Plains during the 1976 and 1977 winter and 
spring wheat growing seasons- Each segment consists of data from all 
available useable acquisitions (average 8) of Landsat 1 and Landsat 2, 
merged together with wall-to-wall ground truth Inventory data supplied 
by USDA and prepared by JSC/LEC. Important pixel-by-pixel information 
generated during processing, such as cloud/water/shadow identifications 
or blob numbers, were retained with each segment - 

In order to support the variety of tasks depending on this data 
base, segments were selected after initial screening into two cate- 
gories, A and B. The principal differences between the two categories 
were in the segment selection procedures and the state to which the seg- 
ments were processed. For some applications it is appropriate to use 
Category A segments for development and training, and Category B segments 
for test and evaluation. 

The segment selection procedures used were as follows. First the 
171 segments available to us were screened to eliminate those whose 
acquisition history, ground truth quality, and data quality are inade- 
quate. When carried out, 107 segments remained for possible selection. 
Next, Category B segments were selected by randomly picking a quota (1/3) 
of segments within each APU* and each year, subject to the constraint 

Agrophysical Unit, as used within LACIE. 
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that at least one segment is selected from each APU. Since there is 
data from txjo growing seasons, for which different conditions pre- 
vailed, the ' APU 's were considered distinct from one growing season 
to the next when establishing this random draw. By this procedure, 

38 segments were selected as a uniform draw that tended to cover the 
variability of the region in a carefully drawn statistical sample. 

The remaining segments were considered in picking Category A seg- 
ments. Stricter minimum limits on acquisition history and data quality 
were observed, so that selected segments were well suited for develop- 
ment work. However, the selections were more qualitative, taking into 
account desire for more acquisitions versus desire for a nice spread 
to cover variability. Even though selecting Category A segments first 
would have resulted in more and nicer segments , it was necessary to 
select them second in order to maintain the statistical integrity of 
the Category B sample. 

The segments selected as described above are presented in Tables 
L-1 and L-2. Location, strata, and other information is include as well. 

A set of flow diagrams that summarized the processing carried out 
on each segment is given in Figure L-1. Category B segments are processed 
using all steps except SUPERB, STRIP and COMPRS. A one-paragraph des- 
cription of each processing step and related comments is given below. 

CONVRT performs a reformatting of Accuracy Assessment 1-channel 
ground truth data in 2x3 subpixel form into standard pixel form with 
7 channels (summary code + 6 subpixel codes for each pixel) . 

MERGE reads ground truth data and several single-acquisition 
Landsat files and writes a single file containing merged data. 

SCREEN flags data as clouds, water, shadows, wild, etc, if not 
within a predefined envelope for agricultural data. One channel is 
added for each acquisition. 
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TABLE L-1, CATEGORY A SITES IN THE SMALL GRAINS DATA BASE 

Number of 


Site 

State 

County 

APU 

Phase 

Latitude 

Longitude 

Acquisitions 

1005 

CO ' 

Cheynne 

10 

III 

38.49 

102.20 

9 

1035 

KS . 

Ford 

11 

II 

37.44 

99.58 

14 

1041 

KS 

Meade 

8 

ir 

37.3.9 

100.16 

10 

1059 

TX 

Ochiltree 

4 

III 

36.15 

100.52 

7? 

1099 

CO 

Baca 

. 9 

III 

37-. 25 

102.18 

10 

1158 

KS 

Washington 

12 

III 

39.47 

97.06 

7 

1175 

KS 

Sedgwick 

7. 

III 

37.46 

97.32 

6 

1178 

KS 

Bourbon 

7 

II 

37.43 - 

94.59 ■ 

7 

1506 

CO 

Sedgwich 

10 

III 

40.51 

102.31 

11 

1512 

MN 

Clag 

15 

III 

47.01 

96.22 

5 

1523 

MN 

Cokkin 

20 

III 

■ 46.31 

96.25 

7 

1566 

NB 

Kimball 

15 

III 

41.02 

103.43 

10 

1586 

.NB 

Perkins 

103 

III 

40.46 

101.20 

13 

1606 

ND 

Ward 

19 

III 

• 48.16 

101.22 

4 

1619 

ND 

Grand Forks 

20 

III 

48.04 

97.30 

6 

1637 

ND 

Stutsman 

21 

II 

47.15 

99.19 

6 

1640 

ND 

Barnes 

19 

in' 

46.55 

97.51 

9 

1642 

ND 

Cass 

20 

II 

46.44 

97.34 

9 

1645 

ND 

Traill 

20 

II 

47.33 

96.56 

9 

1648 

ND 

Recoman 

19 

III 

46.04 

103.06 

5 

1652 

ND 

Stark 

20 

III 

46.56 

102.50 . 

5 

1662 

ND 

Ransom 

19 

II 

46.24 

98.02 

9 

1681 

SD 

Roberts 

21 

III 

45.32 

96.49 

8 

1807 

SD 

Bonhomme 

18 

III 

43.03 

97.57 

6 . 

1852 

KS 

Lane 

11' 

II 

38.30 

100.26 

12 

1857 

KS 

Grant 

9 

II 

37.41 

101.08 

15 

1860 

KS 

Hodgeman 

11 

II 

38.08 

99.42 

‘ 13 

1865 

KS 

Stevens 

14 

II 

37.12 

101.09 

11 

1887 

KS 

Russel 

11 

II 

39.07 

98.55 

7 


265 




TABLE L-2. CATEGORY B SITES IN THE SMALL GRAINS DATA BASE 

Number of 


Site 

State 

County 

APU 

Phase 

Latitude 

Longitude 

Acquisitions 

1000 

CO 

Logan 

10 

III 

40.34 

102.54 

13 

1008 

CO 

Kit Carson 

10 

III 

39.33 

102.19 

5 

1049 

OK 

Texas 

9 

III 

'36.46 

101.20 

9 

1060 

TX 

Sherman 

9 

III 

36.22 

101.41 

4 

1102 

MT 

Yellowstone 

104 

III 

45.57 

108.20 

8 

1154 

KS 

Mitchell 

12 

II 

39.25 

98.00 

10 

1163 

KS 

Coffey 

8 

II 

38.15 

95.38 

5 

1165 

KS 

Lina ‘ 

14 

II 

38.10 

94.53 

12 

1167 

KS 

Shawnee 

’ 14 

II 

39.04 

95.43 

6 

1169 

KS 

Commanche 

60 

II 

37.00 

99.16 

6 

1179 

KS 

Butler 

7 

II 

37.35 

97.00 

4 

1181 

KS 

Cowley 

7 

II 

37.17 

96.51 

4 

1242 

OK 

Canadian 

7 

III 

35.27 

98.50 

6 

1355 

OK 

Beaver 

7 

III 

36.35 

100.00 

7 

1489 

SD 

Walworth 

19 

III 

45,29 

99.42 

3 

1498 

SD 

Codington 

16 

III 

44.57 

97.02 

8 

1513 

MN 

Kittson 

15 

III 

48.52 

97.06 

4 

1521 

MN 

Grant 

20 

III 

46.05 

96.01 

5 

1560 

NB 

Banner 

10 

III 

41.29 

104.00 

10 

1568 

NB 

Sheridan 

15 

III 

42.21 

102.41 

7 

1576 

NB 

Lancaster 

14 

III 

40.52 

96.50 

7 

1602 

ND 

Mountain 

21 

III 

48.21 

102.25 

6 

1614 

ND 

Pierce 

19 

II 

48.31 

100.10 

5 

1618 

ND 

Grand Forks 

20 

II 

47.44 

97.32 

10 

1661 

ND 

McIntosh 

21 

II - 

46.16 

99.45 

6 

1663 

ND 

Richland 

20 

III 

46.23 

96,44 

11 

1675 

SD 

McPherson 

21 

III 

45.56 

99.13 

5 
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TABLE L- 

-2 . CATEGORY 

B SITES 

IN THE 

SMALL GRAINS DATA BASE 

(Cont' d) 

Site 

State 

County 

APU 

Phase 

Latitude 

Longitude 

Number of 
Acquisitions 

1677 

SD 

Spinks 

19 

III 

45.04 

98.06 

7 

1699 

SD 

Hyde 

16 

III 

44.28 

99.27 • 

10 

1739 

MT 

Teton 

104 

III 

47.45 

111.30 

10 

1800 

SD 

Mi look 

19 

III 

43.41 

97.24 

7 

1851 

• KS 

Graham 

9 

II 

39,33 

99.57 

11 

1861 

KS 

Kearny 

9 

II 

38.12 

101.25 

13 

1886 

KS 

Rush 

8 

II • 

38.30 

99.20 

9 

1891 

KS 

Reno 

12 

II 

37.58 

98.25 

13 

1913 

ND 

Hettinger 

21 

III 

46.33 

102.47 

7 
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AA Subpixel 

Ground Truth Landsat Data in UNIVERSAL Format 

Data 



FIGURE L-1. FLOW OF PROCESSING CARRIED OUT FOR EACH SEGMENT 
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Pixel Data (Final for 
Category B Segments) 


FIGURE L-1, FLOW OF PROCESSING CARRIED OUT FOR EACH SEGMENT (Cont'd) 









OHALF Format 


FIGURE L~l. PLOW OF PROCESSING CARRIED OUT FOR EACH SEGMENT (Cont’d) 
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PFEA.T calculates features (haze diagnostics) in support of a spa- 
tially varying XSTAE. algorithm. 

XSTAR carries out a spatially varying haze correction on each 
acquisition. It also applies sun angle (cosine) and satellite cali- 
bration corrections to the data. 

TASCAP performs a linear transformation on, the data that for each 
acquisition replaces the four Landsat hands with four Tasseled Cap 
features . 

FLDS15 modifies ground truth codes referring to special fields, 
when necessary, so that they reflect the correct ground truth class 
of the fields. 

SUPERB, described in Appendix D, was used to identify quasi-fields 
that are spectrally homogeneous and spatially contiguous field patterns 
that obey existing ground truth boundaries. 

STRIP identifies pixels on quasi-field edges. A pixel is flagged 
as edge if any of its four strong neighbors belong to a different blob.. 

COMPRS computes and outputs spectral and spatial mean values for 
each quasi-field, as a compressed data set. This compressed data set 
can be used economically for development tasks that require many passes 
through the data. 

For each segment, two output data products are made part of the 
data base. These are pixel data and compressed data (field means). 
Category B segments are not processed through BLOB or COMPRS, and do 
not have compressed data available. For these segments, the final pixel 
data used in the data base is that resulting from XSTAR and TASCAP. It 
is in the same form as pixel data for Category A segments, except that 
channels reserved for BLOB and STRIP results are zeroed. 
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The pixel data for a segment consists of 22,932 pixels organized 
into 117 scan lines. It contains (16+5n) channels, where n is the 
number of Landsat acquisitions processed for the segment. The channel 
assignments, detailed in Table L-3, were designed to allow a variable 
number of acquisitions yet provide consistent locations for most 
channels . 

Due to the number of processing steps that transform the pixel 
data, it was necessary to avoid the possibility that significance 
error would creep in when numerical values are converted to integer 
form between steps. Thus, pixel values for some channels are maintained 
in' floating point form, both during processing and when stored in an 
external format. 

For this reason, and because the largest number of channels we 
used for one data set is 91, the data was stored in an ERIM-defined 
format, rather than in a more widely known format such as UNIVERSAL. 

The name of the format used for pixel data is OHALF. 

The second data product used in the data base is compressed data 
consisting of blob means produced by routine COMPRS. This product con- 
sists of one record for each blob, containing the channels described in 
Table L-4. The channel values within each record are stored in 4-byte 
binary floating point form. 
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TABLE L-3. CHANNEL ASSIGNMENTS FOR PIXEL DATA USED 
IN THE SMALL GRAINS DATA BASE' 


Channel 


1 


2-7 


8-9 

10 

11 

12-16 

17 

18 

19 • 

20 

(13+41) 

thru 

(16+41) 

(16+4n+l) 


Explanation 

Ground truth summary code (zero or equal to unanimous 
subpixel codes) 

Subpixel ground truth codes in the order: upper-left, 

middle-left, lower-left, upper-right, middle-right, 
lower-right) 

Blob number (is 256* chan 8 + chan 9) 

Blob ground truth code (or zero if ground truth is 
ambiguous or unknown) 

Strip- channel (1 if pixel is on edge of blob , 

0 otherwise) 

Reserved for future processing results 
Tasseled Cap Brightness component, acquisition 1 
Tasseled Cap Greenness component , acquisition 1 
Tasseled Cap Yellowstuff component, acquisition 1 
Tasseled Cap Nonesuch component, acquisition 1 

Tasseled Cap components for acquisition i in the 
order given above for i=l. 


Screen channel for acquisition i, given that there 
are n acquisitions . Zero means pixel is good in 
acquisition i. 
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TABLE L-4. CHANNEL ASSIGNMENTS FOR COMPRESSED (FIELD MEAN) 
DATA USED IN THE SMALL GRAINS DATA BASE 


Channel 


Explanation 


JC. 

1 '' 


Segment number 


A unique job identifier 


3 


Blob number 


4 

5 

6 
7 


8 

9 

10 

ll-(104n) 


Blob type. If 1.0, blob has interior pixels. 

If 2,0, blob has only edge pixels. 

Total number of pixels in blob 

Number of interior pixels 

Number of pixels used to form the mean (same as 
Channel 5 if blob has only edge pixels; same as 
Channel 6 if blob has some interior pixels) 

Blob ground truth code 

Midrange of line number coordinates for this blob 

Midrange of point number coordinates 

Acquisition date given as number of days since 
31 December 


(ll+n)-(10+2n) Spectral mean vector 


* 


Same for all blobs 




APPENDIX M 

NONPARAMETRIC CROP PROPORTION ESTIMATION 

Cover and Hart [61] proved an- asymptotic property of the nearest 
neighbor decision rule which raises the question of how this non— 
parametric classification technique can be extended to a nonparametric 
crop proportion estimation technique. This section explores relation- 
ships between classification and proportion estimation and proposes a 
proportion estimation procedure based on nearest neighbor classification. 

In order to reduce notation, we will restrict the . discussion to 
the two-class problem. The reader can think of them as wheat and non- 
wheat. Generalization to the multiclass problem offers no conceptual.’ 
difficulty. Accordingly, let lis assume a population with a multivariate 
mixture density 

£(x) = p£^(x) + Cl-p)f^(x) 

It is convenient to think of the two-dimensional case. Further, we assume 
a training sample of N points x^,x^, . . . ,Xj^ together Xirith the true classi- 
fication X. of the point x , 1 ^ i < N; i.e. , 

X = 1 if X. is from class 1 
XI 

X = 0 if X, is from class 0 
1 X 

The Nearest Neighbor Classifier 

Tne nearest neighbor (NN) rule classifies a sample point x as being 
from class 1 if the point closest (Euclidean distance) .to x.from among 
the X. is in class 1, and is classified as from class 0, otherwise, 
ilie probability of misclassiflcation for this classification scheme is 
denoted by R^^, and R is defined by 

R = llm 
N-xo 
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, Now suppose p, and were known. Then one could use the 

optimxain Bayes classification rule to minimize the probability of mis- 

* 

classification. Let this probability be denoted by R . The Cover and 

Hart result is that under very general conditions 

* * * 

R <, R ^ 2R (1-R ) 

(Of course the left side inequality is an .immediate consequence of the 
optimality of the Bayes classifier.) 


Classifiers and Proportion Estimation 

We digress here to discuss the relationship between classification 
and proportion estimation. Let us suppose we have a classifier such that 
the probability of correctly classifying a point in class 1 is a and the 
probability of correctly classifying a point in class 0 is 6 . If a and 6 
are known, then the minimum variance unbiased estimate p of the propor- 
tion p is 




- _ p - q-s) 

^ a+6-l 


t 



where p denotes the relative number of sample points classified into 
class 1. 


If a and 6- are unknown, then, as discussed in Section 6, no 
unbiased estimate of p exists and the mean square error of any estimate 
of p is bounded below, independently of sample size, with the bound 
depending on the ranges assumed' for a and 5 . Thus the extension of a 
classifier into a satisfactory proportion estimator depends upon obtaining 
adequate information about ct and 6 . 


factor 


The reader will recognize the NASA (Feiveson) bias correction 
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In order to obtain estimates a and 6 of the conditional proba- 
bilities of correct classifications a and 6 of the nearest neighbor 
classifier, we proceed as follows. For each i, 1 ^ i ^ N, let = 1 
if the point in the sequence x^,X 25 -*-j ^±+l’**‘*^N to 

is in class 1, and let = 0 otherwise. Then set 

X.=l X.=l 

X ' X 

and 

6 =[E (1- y.)] /E (i - V 

x,=o x.=o 

X X 

It appears reasonable to expect a and 6 to converge (in some sense) to 
a and 6 with increasing N. 

Now let ^ sample of data points without true class 

information available. We refer to the as the training data, and . 

the y^’s as the test data. We would like to estimate p using both the 
training data and the test data. Let , 1 ^ j ^ M, denote the class 
assigned to by the nearest neighbor rule. For estimating p, we have 
the information contained in the x. with their true classifications X., 

X . X 

the y^ with their assigned classifications , and the estimated conditional 
probabilities a and 6 of correct classification of the NN classifier. 
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If we had the training points with true class given, test 
points y j > s classifier of the test points, and conditional proba- 
bilities a and 6 of the classifier, then we could readily write down 
the likelihood for the M+N observations 


(^l>^l) s C^2 > * * • ’ 

assuming all these observations were Independent. The nearest neighbor 
classifier introduces dependencies between the (y.,V;)’s and the (x^,A^)'s. 
Furthermore, we only have estimates of ct and 6 available. Nevertheless, 
it seems reasonable to proceed to estimate p by using the maximum likeli- 
hood procedure for independent observations and exact values of ot and 6 
Instead of estimates. 

Accordingly, we define the estimate p of p to be the value of p which 
maximizes the function L(p) given by 

‘ + 6 - l)p + (1-6)] [6 - (a + 6 - l)p] ° 


- = N - N^- 


- V.) = M - M, 
3 1 


L(p) = p -(1-p) L(c 


where 


N 


N = r X, 
1 1-1 ^ 


N 


■No - z (1 
x=l 


M 


“i ■ 2 "i 
i=i ^ 


M 


Mo = z; (1 

3=1 
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Now the function ■ 

lnL(p) = lnp'+ NQln(l-p) + M^ln[a-hS-I)p + (1-6)] + MQln[6-(a+6-l)p] 

is concave (its second derivative with respect to p is negative) so that 
there is a unique maximum. Also, concavity facilitates numerical compu- 
tation of p. 

As of now we do not know if p possesses any useful properties. For 
example, what about mean-square error? Let 

Let p denote the minimum variance unbiased estimate of p based only on 
the N training data points and their associated classes. Let Var^^ denote 
the variance of p. If a + g 1, is it true that 


lim 

N-x» 

M=KN 


2 

^ N,M 

Varjj 


< 1 


9 


where lim denotes limit superior, and K is any fixed integer. In words: 
Does the test data help to estimate p in the large sample case? 


We now consider the form the analog of the Cover and Hart result 

for p might take. Suppose f^(x) and f^(x) are known. Then one can, 

in principle, obtain the maximum likelihood estimate of p based on the 

observations (x.,u.) and the y.. Let this estimate be denoted by p and 
_2 1 1 2 

let e be the mean-square error of this estimate. An analog of the 
Cover and Hart result would specify an upper bound for 


lim 

N-4<» 

M“KN 


2 

^ N,M 
-2 

^ N,M 
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APPENDIX K 

STATIC STRATIFIED SAMPLING ALLOCATION TECHNIQUES 

N,1 INTRODUCTION, ASSUMPTIONS, AND NOTATION 

In this Appendix, we assume that there are M strata which are 
defined only by spectral /temp oral variables. Thus all segments would 
have comparable strata. We also assume that the sampling targets are 
the same size and the ratio of the sample size to the population size 
is so small that we can approximate the sampling model with the model 
which samples with replacement. 

We denote the strata by S. for i=l,...,M; the size of the strata 
M ^ 

by N., N, = j; N^, and the strata sample sizes by n.. 
i=l 

N.2 SAMPLE ALLOCATION TOEN LABELS ARE CORRECT AND EACH STRATUM HAS 

A PRIOR DISTRIBUTION FOR P. 

1 

The Bayesian view of stratified sampling with prior distributions 
for the strata proportions Pi is as follows: The proportion of wheat 

in stratum i is a random variable with an unknown distribution Ti. 
Mother nature randomly chooses Pi according to ti then chooses Xi 
according to a binomial (n^, Pi) distribution, where Hi is the sample 
size of stratum i. 

We assume that we have a prior distribution for each stratum. 
These priors were obtained from past observations or from the experi- 
menter's intuition. The priors and outcomes of an experiment yield 
posterior distributions'. If the priors are "good" then the posterior 
becomes the best estimate of the unknown t^'s. The posterior contains 
all information from the prior and the experimental data. We wish to 
use the priors to allocate samples to the strata in order to minimize 
expected variance of the segment proportion estimate. 
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If the proportion of wheat labels in stratum i is and the sample 
size is n^, then the distribution of the number of wheat labels in 
the sample, has density 


f^(x|pp 




n.-x 

(l-P^) "• 


x==0,l,2, . 


,n^, 0 ^ :S 1 


If the density of the i^^ prior is g^(P), then the density of the joint 
distribution is 


h^(X,P) = f^(x[P)g^(P) 

and the density of the marginal distribution of X^ is 

f.(X) = /h(x,P)dP 


The posterior distribution of P^ given is 


h (X',P.) 

g,(P XJ = -> 


X 


f^(X) 


Often for convenience, it is assumed that 


g. (P) = P^(l-P)® 0<P<1, r>0, s>0 

This is called a beta (r,s) density. In this case is a 

beta (r+X^, s+n^-x^) density. 

■ We estimate the segment proportion 

M N, X. 

P= S 

i=l N n. 

• X 

And the ejipected variance of P is 
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= Jl i^fo S. (P)dP 


Thus, the allocation which minimizes V(P) with a fixed total sample 

•ortj 

N.\ 


size is to choose n^ proportional to 


m 


Jp(i-p)g^(p)dp ^ 


Example 1 ; Suppose that there are four strata, total sample size 
of 30. We assume that we have no prior information about 
and T^, so we set our prior densities as: 


g^(P) = 1 if 0 -< P < 1 
= 0 otherwise 


Suppose that N^-250, N2=150, W^=50, N^=50, and N=500. 
. minimize the expected variance we much have. 

.1 . 


N. r 

n. ~ I 

1 N. Jo 


. p(i-p)^p = ^ \ 


In order to 


which is consistent with [1] . In this case, it is possible to allocate 
exactly proportional r=n^ , thus n^=15, n^=3, and n^=3. The table 
gives the number of wheat labels observed. 


i. 

N. 

X 

n. 

1 

X. 

X 

1 

250 ■ 

15 

1 

2- 

150 

9 

■ 2 

3 

50 

3 

2 

4 

50 

_3 

3 


500 30 

We estimate P with 

P = ^ 4- + 50_ / 2 \ ^ /3 \ _ U 

- 500 \15/ 500 V 9/ 500 \ 3/ 500 45 
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and obtain the posteriors as described above, namely: 

g^(p|x^=l) = 2‘40P(1-P)^^ 

. §2 (P 1 X 2 = 2 ) = 360P^(1-P)^ 
g2(PiX^=2) = 12P^(1-P) 
g-^(p|X^=3) = 4p2 

The graph of these posteriors is given in Figure N-1. 

Example 2: Suppose that we have the same four strata definitions with 

another segment and we use the posteriors from Example 1 (view the 
segment in Example 1 as a training segment). In order to allocate, 
a sample of size 50 to these four strata, we give 

and n. 

as follows. 


'1 ^ , "i 


1 

200 


16 

(16.52) 

2 

150 

. (if) V? - 

17 

(16.87) 


100 


12 

(11.796) 

4 

- 50 


5 

(4.81) 


In the last section, when we assigned that the labels were all cor- 
rectly classified we could update our prior every time we sampled. In 
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FIGURE N-1. ILLUSTRATION OF POSTERIOR PROBABILITY DENSITIES 





this section, we assume that grain targets from stratum i are classified 
as grain with probability and non-grain targets are classified as 
non-grain with probability Unless we have ground truth for all of 

the targets in the sample, we cannot update our prior. 


We first assume that for some training segments we have ground 
truth for our sample so that we can construct a prior. On the first 
segment we set g^(P,a,g) = 1, (P,a,g)e[0,l]^, i=l,2,...,M. Let n^ 
denote the i^^ sample size, W^ denote the grain targets which are 
labeled grain, denote the grain targets, which are labeled npn-grain, 
Wj^' denote the non-grain targets which are labeled grain, and denote 
the non-grain targets which are labeled non-grain. The density of 
(W., Y W/, Y ) given (P , a , g ) is 


f .(W,Y^,W%Y P.,a.,g.) = 

1 - ' 1 1 X 


,W Y^^^W' Y)p.^^'a^(l-a.)-^'(l-P.)^''-*-^l-g.)-^''gJ 


where WY-'+W'+Y = n.. 

X 

The joint density of (W,, Y.', VI,', Y., P., a., g.) is 

•' X X X X X X X 

h^(W,Y%W%Y,P,a,g) = g^(P,a,g)f^(W,Y^W^YlP,a,g) 
The marginal density of (W^, • 


N. 


f Y W Y) - { ^ 1 WiY-lW-m 

,W Uj y.. yj (N.+l)! (W+Y-+1) (W^+Y+1) 


The computations are shown in Table N-1. 

The posterior of (P^, a^, g^) given (W^, ^i^’ ^i^ 
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TABLE N-1. COMPUTATION OF THE MARGINAL DENSITY OF (W. , Y' , W^, Y.) 

1 1 i 1 




.1 ^1 


f^(W,Y%W",Y) 

dP da d6 


fo jo fo („,-"^^jp'^’''»Vc.)^'(l-P)’''"^(l.-6)”VdPdad. 

^ ’"I ^ f \ ' 

Y' W" yJ Jo J ^ \ (n>l); )Jo \(W+Y0!(W'‘ 

' Hi \ (W+Y-)!(W-+Y)! r g.W- Y _.W!Y:!__ (W+Y^+l) ! W,. ^,Y 

W YMr y) ^ ^ |(W+Y+D! Jo W!Y"! 

' \ W!Y-1(W-+Y)! W' Y I 

W Y" W' y) (n^+1) ! (W+Y-+1) J ^ ^ 

\ W!Y'!(W"+Y)! ( W"!Y! f (W'+Y+l) ! ,, .^W^Y 

W Y' W" Yi (n.+l) ! (W+Y"+l) I (W'+Y+l) ! Jo W'.'Y! ^ 


f / •' • w+y' w +Y I 

L \ (W+Y-)KW-+Y)! " ‘'7 


W!Y"! 


a (1-a) da dB 


\ W!Y'!(W'+Y)! / W"!Y! 

W Y' W" Yj (n.+l) ! (W+y+1) I (W-'+Y+l) 


W Y' W-* Y 


W!Y"!W"!Y! 

(n^+1) ! (W+Y'+l) (W-+Y+1) 



^RIM 

Lmmi 


h.(W,Y%W',Y,P,a,g) 

S.(P,a,B|w,Y',H',Y) - (H,Y-.H'.Y) 


(n+l):(W-Y'+l)(W'+Y+l) 

P a (1-a) (1-P) (l-$) g 


w:y';w"!y: 


This posterior could be updated by still another experiment with label's 
and ground truth using similar methods as given above. 

In an experiment without ground truth, we only observe 
Y^+Y^", that is we only observe the number of grain labels and non-=-grain 
labels. Thus, it is not reasonable to update the priors. The variance 
of 


W.+W.^ 

P. = — — 

1 n . 


given (P , a , g ) is 


v(P ) = [Pa+(i-p)(i-B)] rpq-a) + q-p)g] 

^ -n. 

X 


Thus the variance of 


given (P,a,B) is 


_ N. _ 

= 11 xT P- 
N X 
x=l • . 


^i / N. 


vcp) =■ E iiri v(p.) 

i=l 


The stratified sampling allocation which minimizes EV(P) with a 
fixed total sample size is to choose n^ proportional to 




[Pa + (1-P)(1-B)] [P(l-a) + (l-P)B]g^(P,a,B) 


where the priors are the posterior from the training experiment. 
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N.4 USE OF BOUNDS ON P., a., AND B. IN STRATIFIED SAMPLE ALLOCATION ’ 

1 X. X 

We assume that we have oBtaxned from training segments the following 
hounds : 

a. > A. > B. > B. > i 
X— X 2 X— X 2 


and case (i) P . > Ct . > or 

i — 2 


(ii) P . < Gu. < y 
This gives the following results: 


V(P^) = 


[P a. + (l-P.)a-BJ] [P.(l-a >+ d-P.)B.] 

n. 

X 


pA^(l-a^) + P^(l-P^)(a^B^ + (l-a^)(l-B^)) + (l-P^)^B^(l-B^) 

n., 

X 

In case (i) we have 

A (1-A ) + Ct . (1-Cl ) (3-A -B )-+ (1-0^ )^B, (1-B. ) 

Y(p ) < _± ± i ± ^ ^ ^ ^ 1 


X — 


n. 

X 


and in case (ii) 

) Cu^-Vci-A.) + Cu.(l-Cu.)(3-A.~B.) + B.(l-B.) 

^ — - n . 

X 

In either, case, denote these upper bounds on V(P^) as Thus, 

the variance of 


m N. 


■ P = y p 

•S N i 
x=l 


.is as follows 
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M 


N.\2 D. 2 


M /i^T \2 

v(p) - L (#■; v(p.) < lif-) ^ 

■ 1=1 • 1 = 1 ' •' 1 


In this case we allocate the samples to the strata proportional to 


N, 

1 

^ D- 

N X 

Example 3. Suppose we have four strata with 8, 9, a^> ^7 . 


3^>.9, ^3-'®’ ^4-*^^= 

N2=200, N2=150, and N^=100. Values 

, 1 * 2 -'^'’ ^3--^’ ^ 4 !-^ 

for N., A., B., C,, and 

11-11 i 

, N^=250 
are: 


N. A, 

1 1 

B. Cr. 

1 ^1 

Cn. D . 

^1 1 

N'.D. 
1 1 

1 

250 .8 

.9 

.1 /720^ 

114.182 

2 

200 .9 

,95 

,4 / 7 J 379 

116.258 

3 

150 .7 

.8 ' .6 

/i5956 

115.763 

4 

100 .9 

.99 .8 

/. 267996 

.51.768 

Thus, 

if we have -a 

total sample size 

of 50, then 




= 14 

(14.35) 




U2 = 15 

(14.61) 




"3 = 

(14.54) 




U4 = 6 

(6.50) 
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